• DocumentCode
    1695223
  • Title

    Language model verbalization for automatic speech recognition

  • Author

    Sak, Hasim ; Beaufays, Francoise ; Nakajima, Kensuke ; Allauzen, Cyril

  • Author_Institution
    Google, Mountain View, CA, USA
  • fYear
    2013
  • Firstpage
    8262
  • Lastpage
    8266
  • Abstract
    Transcribing speech in properly formatted written language presents some challenges for automatic speech recognition systems. The difficulty arises from the conversion ambiguity between verbal and written language in both directions. Non-lexical vocabulary items such as numeric entities, dates, times, abbreviations and acronyms are particularly ambiguous. This paper describes a finite-state transducer based approach that improves proper transcription of these entities. The approach involves training a language model in the written language domain, and integrating verbal expansions of vocabulary items as a finite-state model into the decoding graph construction. We build an inverted finite-state transducer to map written vocabulary items to alternate verbal expansions using rewrite rules. Then, this verbalizer transducer is composed with the n-gram language model to obtain a verbalized language model, whose input labels are in the verbal language domain while output labels are in the written language domain. We show that the proposed approach is very effective in improving the recognition accuracy of numeric entities.
  • Keywords
    speech recognition; transducers; vocabulary; automatic speech recognition system; decoding graph construction; inverted finite-state transducer model; language model verbalization; n-gram language model; nonlexical vocabulary; verbal language domain; written language domain; Abstracts; Google; Measurement; Nickel; Numerical models; finite-state transducer; language modeling; speech recognition; verbalization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6639276
  • Filename
    6639276