diffusion-based model results #5

yl4579 · 2024-09-06T05:45:36Z

Great work! Have you tested the performance of this codec on diffusion-based models such as SimpleTTS or DiTTo-TTS?

zhenye234 · 2024-09-07T12:34:34Z

Thank you very much for your question! I have not tested this codec with diffusion-based models such as SimpleTTS or DiTTo-TTS. However, I believe investigating which representations—such as mel, codec latent, or semantic—are better suited for audio diffusion generation could yield valuable insights. Thank you once again for your thoughtful inquiry.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

diffusion-based model results #5

diffusion-based model results #5

yl4579 commented Sep 6, 2024

zhenye234 commented Sep 7, 2024

diffusion-based model results #5

diffusion-based model results #5

Comments

yl4579 commented Sep 6, 2024

zhenye234 commented Sep 7, 2024