• DocumentCode
    698650
  • Title

    Language modeling using Independent Component Analysis for Automatic Speech Recognition

  • Author

    Kumaran, Raghunandan S. ; Narayanan, Karthik ; Gowdy, John N.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Clemson Univ., Clemson, SC, USA
  • fYear
    2005
  • fDate
    4-8 Sept. 2005
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Conventional statistical language models such as N-grams are inadequate to model long distance dependencies in natural language. In this paper we propose a novel statistical language model to capture topic related long range dependencies. Humans have the inherent ability to identify long range dependencies in natural language. Given a set of related words humans can easily identify the context in which the set of words is occurring. It has been shown by many researchers that Independent Component Analysis (ICA) captures these kind of dependencies better than any other formulation. Furthermore, ICA provides a topic decomposition that can be easily interpreted by humans compared to other models. This paper describes the development of a language model using ICA. The topic model is combined with a standard N-gram to produce the language model. The perplexity results obtained show that this language model is a viable language model for speech recognition purposes.
  • Keywords
    independent component analysis; speech recognition; ICA; automatic speech recognition; independent component analysis; language modeling; natural language; statistical language model; topic related long range dependencies; Analytical models; Computational modeling; History; Matrix decomposition; Semantics; Speech recognition; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2005 13th European
  • Conference_Location
    Antalya
  • Print_ISBN
    978-160-4238-21-1
  • Type

    conf

  • Filename
    7078242