Getting HTK
Documentation
Mailing Lists
Development
Links
|
Proceedings ICASSP94, pages 125 - 128
Large Vocabulary Continuous Speech Recognition Using
HTK
P.C. Woodland, J.J. Odell, V. Valtchev and Steve J. Young
HTK is a portable software toolkit for building speech recognition
systems using continuous density hidden Markov models developed by the
Cambridge University Speech Group. One particularly successful type of
system uses mixture density tied-state triphones. Recently we have
used this technique for the 5k/20k word ARPA Wall Street Journal (WSJ)
task. We have extended our approach from using word-internal gender
independent modelling to use decision tree based state clustering,
cross-word triphones and gender dependent models. Our current systems
can be run with either bigram or trigram language models using a
single pass dynamic network decoder. Systems based on these techniques
were included in the November 1993 ARPA WSJ evaluation, and gave the
lowest error rate reported on the 5k word bigram, 5k word trigram and
20k word bigram "hub" tests and the second lowest error rate on the
20k word trigram "hub" test.
compress'd PS
|