Skip to content

EC2 Installation Walkthrough

edobashira edited this page Oct 24, 2014 · 22 revisions

Introduction

In this guide I will explain how to setup OpenDcd with Kaldi on EC and decode open source models based on Librispeech corpus. For this walkthrough I used a large instance with four cores and 15GB of memory. OpenDcd is very memory efficient for both decoding and graph construction and this is easily enough to decode the large 4-gram model.

Machine Configure and Setup

   sudo add-apt-repository ppa:ubuntu-toolchain-r/test
   sudo apt-get update
   sudo apt-get install -y gcc-4.9 g++-4.9 cpp-4.9 subversion make zlib1g-dev automake libtool autoconf libatlas3-base

Due a bug in gcc 4.8 we installed gcc 4.9 and set hard links as the system default

   sudo ln -s /usr/bin/g++-4.9 /usr/bin/g++
   sudo ln -s /usr/bin/gcc-4.9 /usr/bin/gcc
   sudo ln -s -f bash /bin/sh

Install Kaldi

   svn co https://svn.code.sf.net/p/kaldi/code/trunk kaldi
   cd kaldi/tools
   make
   cd ../src
   ./configure

For descent runtime performance it is essential to edit the kaldi.mk file and add the -O2 switch. Now just type make to build the Kaldi and optionally specify the number of cores.

  make -j4

Install OpenDcd

   git clone https://github.com/edobashira/opendcd.git
   cd opendcd/3rdparty
   make
   cd ../src/bin
   make -j4

Graph Construction

There are two graph construction methods, in the first we take a set of Kaldi component transducers as the input to the build process. In the second method we take raw language model and lexicon and build everything from scratch. In this recipe we will use the pre-built models from kaldi-asr.org and use the first method.

Download Models

Build Cascade

Decode

Default decoding command

nnet-latgen-faster \
--minimize=false \
--max-active=7000 \
--beam=15.0 \
--lattice-beam=8.0 \
--acoustic-scale=0.1 \
--allow-partial=true \
--word-symbol-table=exp/tri6b/graph_tgsmall/words.txt \
exp/nnet2_online/nnet_a/final.mdl \
exp/tri6b/graph_tgsmall/HCLG.fst \

Default Kaldi feature extraction command

ark,s,cs:apply-cmvn --norm-means=false --norm-vars=false \
  --utt2spk=ark:data/dev_clean_hires/split30/1/utt2spk \
  scp:data/dev_clean_hires/split30/1/cmvn.scp \
  scp:data/dev_clean_hires/split30/1/feats.scp ark:- | \
  paste-feats --length-tolerance=10 ark:- \
  'ark,s,cs:utils/filter_scp.pl data/dev_clean_hires/split30/1/utt2spk \
  exp/nnet2_online/ivectors_dev_clean/ivector_online.scp | \
  subsample-feats --n=-10 scp:- ark:- |'
Clone this wiki locally