DocumentCode :
1695223
Title :
Language model verbalization for automatic speech recognition
Author :
Sak, Hasim ; Beaufays, Francoise ; Nakajima, Kensuke ; Allauzen, Cyril
Author_Institution :
Google, Mountain View, CA, USA
fYear :
2013
Firstpage :
8262
Lastpage :
8266
Abstract :
Transcribing speech in properly formatted written language presents some challenges for automatic speech recognition systems. The difficulty arises from the conversion ambiguity between verbal and written language in both directions. Non-lexical vocabulary items such as numeric entities, dates, times, abbreviations and acronyms are particularly ambiguous. This paper describes a finite-state transducer based approach that improves proper transcription of these entities. The approach involves training a language model in the written language domain, and integrating verbal expansions of vocabulary items as a finite-state model into the decoding graph construction. We build an inverted finite-state transducer to map written vocabulary items to alternate verbal expansions using rewrite rules. Then, this verbalizer transducer is composed with the n-gram language model to obtain a verbalized language model, whose input labels are in the verbal language domain while output labels are in the written language domain. We show that the proposed approach is very effective in improving the recognition accuracy of numeric entities.
Keywords :
speech recognition; transducers; vocabulary; automatic speech recognition system; decoding graph construction; inverted finite-state transducer model; language model verbalization; n-gram language model; nonlexical vocabulary; verbal language domain; written language domain; Abstracts; Google; Measurement; Nickel; Numerical models; finite-state transducer; language modeling; speech recognition; verbalization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6639276
Filename :
6639276
Link To Document :
بازگشت