Distillation is broken #272

eu9ene · 2023-11-20T23:49:35Z

The hypothesis that we also discussed in #161 is that distillation might not work as expected with the on-the-fly augmentation since the student is supposed to be train on the exact outputs of the teacher. The gap between the teacher and student models is way too large. See also #231. I'll try to disable augmentation on the fly for the student model first to see how much it helps.

If it's the case, the proper fix would be to do augmentation of the corpus before decoding by the teachers.

eu9ene · 2023-11-22T01:02:12Z

I removed the augmentation but the new student model is still training https://firefox-ci-tc.services.mozilla.com/tasks/eyHs_KRMRbONpKLXIgj2fA/runs/1

eu9ene · 2023-12-13T00:06:42Z

Retraining with disabled augmentation didn't help, the results are almost the same. There's likely a bug somewhere else.

eu9ene · 2023-12-14T00:33:38Z

It seems the issue is in the merged training corpus. Source and target sentences don't match in the output of merge-translated step: https://firefox-ci-tc.services.mozilla.com/tasks/QlmjiyNmTJ-fzGGM520NMQ/runs/0

eu9ene · 2023-12-19T23:26:12Z

It's supposed to be fixed now. Retesting here https://firefox-ci-tc.services.mozilla.com/tasks/groups/V-OmRM1yS_GwESTVDPl0VQ

eu9ene added bug Something is broken or not correct quality Improving robustness and translation quality labels Nov 20, 2023

eu9ene self-assigned this Nov 20, 2023

eu9ene mentioned this issue Nov 21, 2023

Disable augmentation for student #273

Merged

eu9ene mentioned this issue Dec 15, 2023

Always split corpus to a fixed number of parts #308

Merged

gregtatum mentioned this issue Dec 16, 2023

[meta] Make the pipeline reliable enough to train many languages #311

Open

eu9ene changed the title ~~Distillation is broken after integrating OpusTrainer~~ Distillation is broken Dec 18, 2023

eu9ene closed this as completed in #308 Dec 19, 2023

gregtatum mentioned this issue Jan 16, 2024

[meta] Ship 30 languages #369

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distillation is broken #272

Distillation is broken #272

eu9ene commented Nov 20, 2023

eu9ene commented Nov 22, 2023

eu9ene commented Dec 13, 2023

eu9ene commented Dec 14, 2023

eu9ene commented Dec 19, 2023

Distillation is broken #272

Distillation is broken #272

Comments

eu9ene commented Nov 20, 2023

eu9ene commented Nov 22, 2023

eu9ene commented Dec 13, 2023

eu9ene commented Dec 14, 2023

eu9ene commented Dec 19, 2023