• DocumentCode
    2789670
  • Title

    Language model adaptation using Random Forests

  • Author

    Deoras, Anoop ; Jelinek, Frederick ; Su, Yi

  • Author_Institution
    Center for Language & Speech Process., Johns Hopkins Univ., Baltimore, MN, USA
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    5198
  • Lastpage
    5201
  • Abstract
    In this paper we investigate random forest based language model adaptation. Large amounts of out-of-domain data are used to grow the decision trees while very small amounts of in-domain data are used to prune them back, so that the structure of the trees are suitable for the desired domain while the probabilities in the tree nodes are reliably estimated. Extensive experiments are carried out and results are reported on a particular task of adapting Broadcast News language model to the MIT computer science lecture domain. We show 0.80% and 0.60% absolute WER improvement over language model interpolation and count merging techniques, respectively.
  • Keywords
    decision trees; natural language processing; speech processing; count merging technique; decision trees; language model adaptation; language model interpolation; random forest; Adaptation model; Broadcasting; Computer science; Decision trees; History; Interpolation; Merging; Natural languages; Speech processing; Training data; Adaptation; Language Modeling; Random Forests;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495012
  • Filename
    5495012