Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Grad-TTS

Official implementation of the Grad-TTS model based on Diffusion Probabilistic Modelling. For all details check out our paper accepted to ICML 2021 via this link.

Authors: Vadim Popov*, Ivan Vovk*, Vladimir Gogoryan, Tasnima Sadekova, Mikhail Kudinov.

^{*Equal contribution.}

SPIRAL

Official implementation of SPIRAL: Self-supervised Perturbation-Invariant Representation Learning for Speech Pre-Training. For all details check out our paper accepted to ICLR 2022 via this link.

Authors: Wenyong Huang, Zhenhe Zhang, Yu Ting Yeung, Xin Jiang, Qun Liu.

DiffVC

Official implementation of the paper "Diffusion-Based Voice Conversion with Fast Maximum Likelihood Sampling Scheme" (ICLR 2022, Oral). Link.

Authors: Vadim Popov, Ivan Vovk, Vladimir Gogoryan, Tasnima Sadekova, Mikhail Kudinov, Jiansheng Wei.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
DiffVC		DiffVC
Grad-TTS		Grad-TTS
SPIRAL		SPIRAL
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech-Backbones

Grad-TTS

SPIRAL

DiffVC

About

Releases

Packages

Languages

Neural-Space/Speech-Backbones-NS

Folders and files

Latest commit

History

Repository files navigation

Speech-Backbones

Grad-TTS

SPIRAL

DiffVC

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages