• DocumentCode
    394235
  • Title

    Language model adaptation using WFST-based speaking-style translation

  • Author

    Hori, Takaaki ; Willett, Daniel ; Minami, Yasuhiro

  • Author_Institution
    Speech Open Lab., NTT Commun. Sci. Labs., Kyoto, Japan
  • Volume
    1
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    This paper describes a new approach to language model adaptation for speech recognition based on the statistical framework of speech translation. The main idea of this approach is to compose a weighted finite-state transducer (WFST) that translates sentence styles from in-domain to out-of-domain. It enables to integrate language models of different styles of speaking or dialects and even of different vocabularies. The WFST is built by combining in-domain and out-of-domain models through the translation, while each model and the translation itself is expressed as a WFST. We apply this technique to building language models for spontaneous speech recognition using large written-style corpora. We conducted experiments on a 20k-word Japanese spontaneous speech recognition task. With a small in-domain corpus, a 2.9% absolute improvement in word error rate is achieved over the in-domain model.
  • Keywords
    linguistics; natural languages; speech recognition; Japanese spontaneous speech recognition task; WFST; dialects; in-domain models; language model adaptation; out-of-domain models; sentence styles; speech recognition; speech translation; statistical framework; vocabularies; weighted finite-state transducer; Acoustic transducers; Adaptation model; Context modeling; Interpolation; Laboratories; Natural languages; Speech recognition; Statistics; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1198759
  • Filename
    1198759