DocumentCode
3162564
Title
A comparison of dynamic WFST decoding approaches
Author
Dixon, Paul R. ; Hori, Chiori ; Kashioka, Hideki
Author_Institution
Nat. Inst. of Inf. & Commun. Technol., Kyoto, Japan
fYear
2012
fDate
25-30 March 2012
Firstpage
4209
Lastpage
4212
Abstract
In this paper we perform a comparison of lookahead composition and on-the-fly hypothesis rescoring using a common decoder. The results on a large vocabulary speech recognition task illustrate the differences in the behaviour of these algorithms in terms of error rate, real time factor, memory usage and internal statistics of the decoder. The evaluations were performed when the decoder was operated at either the state or arc level. The results show the dynamic approaches also work well at the state level even though there is greater dynamic construction cost.
Keywords
error statistics; speech coding; speech recognition; arc level; decoder; dynamic WFST decoding; dynamic construction cost; error rate; internal statistics; large vocabulary speech recognition task; lookahead composition; memory usage; on-the-fly hypothesis rescoring; real time factor; state level; weighted finite state transducer; Acoustic beams; Acoustics; Decoding; Heuristic algorithms; Speech recognition; Transducers; Vocabulary; Speech recognition; WFST; on-the-fly composition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location
Kyoto
ISSN
1520-6149
Print_ISBN
978-1-4673-0045-2
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2012.6288847
Filename
6288847
Link To Document