DocumentCode
1695223
Title
Language model verbalization for automatic speech recognition
Author
Sak, Hasim ; Beaufays, Francoise ; Nakajima, Kensuke ; Allauzen, Cyril
Author_Institution
Google, Mountain View, CA, USA
fYear
2013
Firstpage
8262
Lastpage
8266
Abstract
Transcribing speech in properly formatted written language presents some challenges for automatic speech recognition systems. The difficulty arises from the conversion ambiguity between verbal and written language in both directions. Non-lexical vocabulary items such as numeric entities, dates, times, abbreviations and acronyms are particularly ambiguous. This paper describes a finite-state transducer based approach that improves proper transcription of these entities. The approach involves training a language model in the written language domain, and integrating verbal expansions of vocabulary items as a finite-state model into the decoding graph construction. We build an inverted finite-state transducer to map written vocabulary items to alternate verbal expansions using rewrite rules. Then, this verbalizer transducer is composed with the n-gram language model to obtain a verbalized language model, whose input labels are in the verbal language domain while output labels are in the written language domain. We show that the proposed approach is very effective in improving the recognition accuracy of numeric entities.
Keywords
speech recognition; transducers; vocabulary; automatic speech recognition system; decoding graph construction; inverted finite-state transducer model; language model verbalization; n-gram language model; nonlexical vocabulary; verbal language domain; written language domain; Abstracts; Google; Measurement; Nickel; Numerical models; finite-state transducer; language modeling; speech recognition; verbalization;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location
Vancouver, BC
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2013.6639276
Filename
6639276
Link To Document