DocumentCode :
3485225
Title :
Efficient discriminative training of long-span language models
Author :
Rastrow, Ariya ; Dredze, Mark ; Khudanpur, Sanjeev
Author_Institution :
Human Language Technol. Center of Excellence, Johns Hopkins Univ., Baltimore, MD, USA
fYear :
2011
fDate :
11-15 Dec. 2011
Firstpage :
214
Lastpage :
219
Abstract :
Long-span language models, such as those involving syntactic dependencies, produce more coherent text than their n-gram counterparts. However, evaluating the large number of sentence-hypotheses in a packed representation such as an ASR lattice is intractable under such long-span models both during decoding and discriminative training. The accepted compromise is to rescore only the N-best hypotheses in the lattice using the long-span LM. We present discriminative hill climbing, an efficient and effective discriminative training procedure for long-span LMs based on a hill climbing rescoring algorithm [1]. We empirically demonstrate significant computational savings as well as error-rate reduction over N-best training methods in a state of the art ASR system for Broadcast News transcription.
Keywords :
decoding; learning (artificial intelligence); natural language processing; speech recognition; ASR lattice; ASR system; automatic speech recognition; coherent text; decoding; discriminative hill climbing; discriminative training; hill climbing rescoring algorithm; long span language models; sentence hypotheses; syntactic dependency; Acoustics; Complexity theory; Feature extraction; Lattices; Speech; Syntactics; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
Conference_Location :
Waikoloa, HI
Print_ISBN :
978-1-4673-0365-1
Electronic_ISBN :
978-1-4673-0366-8
Type :
conf
DOI :
10.1109/ASRU.2011.6163933
Filename :
6163933
Link To Document :
بازگشت