Add document for Mandarin model. #364

pkuyym · 2017-10-11T14:50:27Z

xinghai-sun

Could you please provide some tips for Mandarin training, especially for those might be different from English training. e.g. data preparation, language model configuration?

xinghai-sun · 2017-10-12T04:23:36Z

deep_speech_2/README.md

@@ -398,7 +398,7 @@ For more information about the DeepSpeech2 training on PaddleCloud, please refer

 ## Training for Mandarin Language

-TODO: to be added
+The steps of training, evaluation and inference for Mandarin ASR model is same with English ASR model. We have provided an example for Mandarin data which using Aishell dataset and you can find it in ```examples/aishell```. As mentioned above, you can execute ```sh run_data.sh```, ```sh run_train.sh```, ```sh run_test.sh``` and ```sh run_infer.sh``` to do data preparation, training, test and inference correspondingly. We have also tuned a setting to get better model performance (not the best), and you can execute ```sh run_infer_golden.sh``` to show some speech-to-text decoding results.


is same with --> is the same to

for Mandarin data which using Aishell dataset and you cdan find it in --> for Mandarin training with Aishell in

you can execute --> please execute

test --> testing

We have also tuned a setting to get better model performance .... ---> We have also prepared a pre-trained model (downloaded in ./models/aishell/download_model.sh) for users to try with sh run_infer_golden.sh and sh run_test_golden.sh.

xinghai-sun

Almost LGTM.

xinghai-sun · 2017-11-03T14:43:59Z

deep_speech_2/README.md

@@ -398,7 +398,7 @@ For more information about the DeepSpeech2 training on PaddleCloud, please refer

 ## Training for Mandarin Language

-TODO: to be added
+The key steps of training for Mandarin Language are same to that of English Language and we have also provided an example for Mandarin training with Aishell in ```examples/aishell```. As mentioned above, please execute ```sh run_data.sh```, ```sh run_train.sh```, ```sh run_test.sh``` and ```sh run_infer.sh``` to do data preparation, training, test and inference correspondingly. We have also prepared a pre-trained model (downloaded by ./models/aishell/download_model.sh) for users to try with ```sh run_infer_golden.sh``` and ```sh run_test_golden.sh```. Notice that, different from English LM, the Mandarin LM is character based and please run ```tools/tune.py``` to find an optimal setting.


Language --> language
test --> testing
character based --> character-based

Add document for Mandarin model.

d457bcc

pkuyym requested a review from xinghai-sun October 11, 2017 14:50

xinghai-sun requested changes Oct 12, 2017

View reviewed changes

pkuyym added 2 commits November 3, 2017 15:50

Refine doc.

f5c8e18

Refine doc for Mandarin training.

6b43e22

xinghai-sun approved these changes Nov 3, 2017

View reviewed changes

Refine doc.

3ffa9e9

pkuyym merged commit bab3be4 into PaddlePaddle:develop Nov 3, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add document for Mandarin model. #364

Add document for Mandarin model. #364

pkuyym commented Oct 11, 2017 •

edited

Loading

xinghai-sun left a comment •

edited

Loading

xinghai-sun Oct 12, 2017

xinghai-sun left a comment

xinghai-sun Nov 3, 2017

Add document for Mandarin model. #364

Add document for Mandarin model. #364

Conversation

pkuyym commented Oct 11, 2017 • edited Loading

xinghai-sun left a comment • edited Loading

Choose a reason for hiding this comment

xinghai-sun Oct 12, 2017

Choose a reason for hiding this comment

xinghai-sun left a comment

Choose a reason for hiding this comment

xinghai-sun Nov 3, 2017

Choose a reason for hiding this comment

pkuyym commented Oct 11, 2017 •

edited

Loading

xinghai-sun left a comment •

edited

Loading