This release fixes a few long-standing bugs in SphinxTrain and makes the package (hopefully) easier to use. Among other things:
- The dependency on SphinxBase is gone, because SphinxBase is gone
- The dependency on Sphinx3 for VTLN and force-alignment is gone (
sphinx3_align
is included) - Multi-CPU training actually works, tested on up to 64 CPUs with LibriSpeech, much easier than setting up PBS on the Clown
- The dependency on Visual Studio for buliding on Windows is gone (but please just use WSL, please)
- The dependency on Autotools is gone (CMake ain't great but it's much less bad)
- There is a Dockerfile now
- There is "continuous integration" now (sort of)
- The
-remove_silence
option has been disabled by default (unlike in PocketSphinx you can still turn it on if you really want to, it might save you a bit of time in training) - It is not necessary to install SphinxTrain system-wide to run training
- G2P support has been updated for the most commonly installed version of OpenFST (do not try to use any other version, because C++, that's why)
What's Changed
- Pick Decoding Model Based on Context Dependence by @Mazyod in #19
- Add includes and fix number of params for bio call by @lenzo-ka in #21
- Output an error message when we cannot execute a tool by @cshung in #10
- Python3 by @dhdaines in #22
- update kal email by @lenzo-ka in #24
- Make an option in config for not folding case in phonemes by @lenzo-ka in #26
- Use consistent shebang for python by @acgrobman in #28
- Include SphinxBase code in SphinxTrain by @dhdaines in #29
- Add -sox flag to sphinx_fe to convert files with SoX by @dhdaines in #30
- Update and enable G2P code by @dhdaines in #39
- Fix stalled Queue::POSIX training by @dhdaines in #40
- Minimally fix MLLT by @dhdaines in #41
- Librispeech training template by @dhdaines in #31
- Switch build to CMake and fix a few bugs by @dhdaines in #45
- Include sphinx3_align by @dhdaines in #46
New Contributors
- @Mazyod made their first contribution in #19
- @lenzo-ka made their first contribution in #21
- @cshung made their first contribution in #10
- @dhdaines made their first contribution in #22
- @acgrobman made their first contribution in #28
Full Changelog: https://github.com/cmusphinx/sphinxtrain/commits/v5.0.0