Skip to content

jdrex/ctcdecode

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ctcdecode

ctcdecode is an implementation of CTC (Connectionist Temporal Classification) beam search decoding for PyTorch. C++ code borrowed liberally from Paddle Paddles' DeepSpeech. It includes swappable scorer support enabling standard beam search, and KenLM-based decoding.

This fork implements the subword prefix beam search decoding algorithm described in:

Drexler, Jennifer, and James Glass. "Subword Regularization and Beam Search Decoding for End-to-end Automatic Speech Recognition." IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019.

Installation

The library is largely self-contained and requires only PyTorch and CFFI. Building the C++ library requires gcc or clang. KenLM language modeling support is also optionally included, and enabled by default.

# get the code
git clone --recursive https://github.com/parlance/ctcdecode.git
cd ctcdecode
pip install .

About

PyTorch CTC Decoder bindings

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • C++ 80.8%
  • Python 17.0%
  • C 2.2%