training#
Functions
re-order teacher output tokens so that it aligns with student output tokens with greedy search |
|
calculate the distance between student output tokens and teacher output tokens |
|
Functions
re-order teacher output tokens so that it aligns with student output tokens with greedy search |
|
calculate the distance between student output tokens and teacher output tokens |
|