• DocumentCode
    2788418
  • Title

    Language model combination and adaptation usingweighted finite state transducers

  • Author

    Liu, X. ; Gales, M.J.F. ; Hieronymus, J.L. ; Woodland, P.C.

  • Author_Institution
    Eng. Dept., Cambridge Univ., Cambridge, UK
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    5390
  • Lastpage
    5393
  • Abstract
    In speech recognition systems language model (LMs) are often constructed by training and combining multiple n-gram models. They can be either used to represent different genres or tasks found in diverse text sources, or capture stochastic properties of different linguistic symbol sequences, for example, syllables and words. Unsupervised LM adaptation may also be used to further improve robustness to varying styles or tasks. When using these techniques, extensive software changes are often required. In this paper an alternative and more general approach based on weighted finite state transducers (WFSTs) is investigated for LM combination and adaptation. As it is entirely based on well-defined WFST operations, minimum change to decoding tools is needed. A wide range of LM combination configurations can be flexibly supported. An efficient on-the-fly WFST decoding algorithm is also proposed. Significant error rate gains of 7.3% relative were obtained on a state-of-the-art broadcast audio recognition task using a history dependently adapted multi-level LM modelling both syllable and word sequences.
  • Keywords
    speech recognition; stochastic processes; transducers; different linguistic symbol sequences; language model adaptation; language model combination; speech recognition systems language model; stochastic properties; weighted finite state transducers; Adaptation model; Broadcasting; Decoding; Error analysis; History; Natural languages; Robustness; Speech recognition; Stochastic processes; Transducers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5494941
  • Filename
    5494941