DocumentCode
672330
Title
Efficient nearly error-less LVCSR decoding based on incremental forward and backward passes
Author
Nolden, David ; Schluter, Ralf ; Ney, Hermann
Author_Institution
Human Language Technol. & Pattern Recognition Group, RWTH Aachen Univ., Aachen, Germany
fYear
2013
fDate
8-12 Dec. 2013
Firstpage
66
Lastpage
71
Abstract
We show that most search errors can be identified by aligning the results of a symmetric forward and backward decoding pass. Based on this knowledge, we introduce an efficient high-level decoding architecture which yields virtually no search errors, and requires virtually no manual tuning. We perform an initial forward- and backward decoding with tight initial beams, then we identify search errors, and then we recursively increment the beam sizes and perform new forward and backward decodings for erroneous intervals until no more search errors are detected. Consequently, each utterance and even each single word is decoded with the smallest beam size required to decode it correctly. On all tested systems we achieve an error rate equal or very close to classical decoding with ideally tuned beam size, but unsupervisedly without specific tuning, and at around 2 times faster runtime. An additional speedup by factor 2 can be achieved by decoding the forward and backward pass in separate threads.
Keywords
decoding; speech coding; speech recognition; error-less LVCSR decoding; incremental backward passes; incremental forward passes; large vocavulary continuous speech recognition; symmetric backward decoding pass; symmetric forward decoding pass; Acoustic beams; Acoustics; Context; Decoding; Error analysis; Hidden Markov models; Runtime; LVCSR; decoding; error detection; error-less; pruning; search;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on
Conference_Location
Olomouc
Type
conf
DOI
10.1109/ASRU.2013.6707707
Filename
6707707
Link To Document