Skip to content

PaddleSpeech r1.5.0

Latest
Compare
Choose a tag to compare
@zxcd zxcd released this 05 Mar 07:11
9c01a0b

Highlight

New Features

  • Add AudioTools toolkit support used in DAC (Descript-Audio-Codec) training and inference.
  • Reproduce the losses required for DAC model: MultiScaleSTFTLoss, GANLoss, and SISDRLoss.

Version Adaptation

Upgrade and adapt PaddleSpeech from Paddle 2.5.1 to Paddle 3.0.0-beta. Address incompatibility issues caused by the new version upgrade of Paddle, perform adaptation development and regression testing on the models in PaddleSpeech, and ensure the suite operates normally without loss of model functionality or accuracy.

  • Ensure the adaptation of 80+ existing models in the demo and example directories.
  • Ensure the adaptation and accuracy alignment of 10+ core models in the example directory.
  • Support the re-export of 20+ dynamic-to-static models using the PIR + predictor approach and ensure successful inference.

More Detail

New Features

Version Adaptation

Installation Adaptation

Hardware Support

Docs

Bug Fix

CI

Acknowledgements

Special thanks to contributors including @wanx7130, @warrentdrew, @DrRyanHuang, @cchenhaifeng, @undefined-ux, @zxcd, @GreatV, @yinfan98, @Liyulingyue, @megemini, @SuiYunsy, @Netrvin, @enkilee, @tianshuo78520a, @guspan-tanadi, @co63oc and others for their support.

New Contributors