DocumentCode
573174
Title
Developing a hybrid language model for open vocabulary automatic speech recognition in a lecture speech task
Author
Rondeau, Marc-Antoine ; Rose, Richard
Author_Institution
Dept. of Electr. & Comput. Eng., McGill Univ., Montréal, QC, Canada
fYear
2012
fDate
2-5 July 2012
Firstpage
114
Lastpage
119
Abstract
This paper addresses the problem of open vocabulary automatic speech recognition (ASR) using hybrid statistical language models (LMs). Hybrid LMs differ from closed vocabulary LMs in that the word level lexicon is augmented with an inventory of sub-lexical units (SLUs). The procedures used for selecting these SLUs and expanding out-of-vocabulary (OOV) words according to the SLUs is presented in the paper. The open-vocabulary ASR performance obtained using these techniques is presented for a lecture speech task domain.
Keywords
computer aided instruction; natural language processing; speech recognition; statistical analysis; vocabulary; OOV words; SLU; closed vocabulary LM; hybrid statistical language models; lecture speech task; open vocabulary automatic speech recognition; open-vocabulary ASR performance; out-of-vocabulary words; sublexical units; word level lexicon; Accuracy; Entropy; Lattices; Probabilistic logic; Speech; Training; Vocabulary; Speech Recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Science, Signal Processing and their Applications (ISSPA), 2012 11th International Conference on
Conference_Location
Montreal, QC
Print_ISBN
978-1-4673-0381-1
Electronic_ISBN
978-1-4673-0380-4
Type
conf
DOI
10.1109/ISSPA.2012.6310464
Filename
6310464
Link To Document