Chinese support #9

MonolithFoundation · 2024-12-13T06:27:28Z

would consider support Chinese?

cantabile-kwok · 2024-12-14T13:53:16Z

That is a good question. We want to see how this model works on Chinese, but the core problem is not about model or dataset; it is about the speech tokens. Since in the paper we use vq-wav2vec, which is only trained on English Librispeech corpus, we don't expect it to generalize very well to Chinese. We need to find another token which contains limited timbre information and enough prosody information for Chinese, which seems a bit hard. Training a vq-wav2vec on Chinese dataset is also a larger project. Hence, we would not train this on Chinese unless there is a satisfactory speech token ready to use.

Nevertheless, the language restriction is only on the source speech. For the target reference, any language is feasible (i.e. no problem from English content to Chinese speaker).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chinese support #9

Chinese support #9

MonolithFoundation commented Dec 13, 2024

cantabile-kwok commented Dec 14, 2024

Chinese support #9

Chinese support #9

Comments

MonolithFoundation commented Dec 13, 2024

cantabile-kwok commented Dec 14, 2024