Why can the two scale models be used directly together without training? #2

IzumiKDl · 2024-12-09T14:30:46Z

Thank you so much for your amazing work! When running the code, we noticed that two VAR models of different scales are directly employed as the drafter and refiner models without additional training. This raises the question: why can these two models be used together seamlessly without further fine-tuning?

czg1225 · 2024-12-09T14:40:10Z

Hi @IzumiKDl ,
Thanks for your interest! Although the drafter and refiner are two different VAR models, they use the same VQVAE encoder during the training phase. This means that they have a shared discrete latent space. As a result, they can conduct collaborative decoding without training. It is worth mentioning that fine-tuning the model on their specialized scales will further improve the generation effect.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why can the two scale models be used directly together without training? #2

Why can the two scale models be used directly together without training? #2

IzumiKDl commented Dec 9, 2024

czg1225 commented Dec 9, 2024

Why can the two scale models be used directly together without training? #2

Why can the two scale models be used directly together without training? #2

Comments

IzumiKDl commented Dec 9, 2024

czg1225 commented Dec 9, 2024