Title :
Language modeling using Independent Component Analysis for Automatic Speech Recognition
Author :
Kumaran, Raghunandan S. ; Narayanan, Karthik ; Gowdy, John N.
Author_Institution :
Dept. of Electr. & Comput. Eng., Clemson Univ., Clemson, SC, USA
Abstract :
Conventional statistical language models such as N-grams are inadequate to model long distance dependencies in natural language. In this paper we propose a novel statistical language model to capture topic related long range dependencies. Humans have the inherent ability to identify long range dependencies in natural language. Given a set of related words humans can easily identify the context in which the set of words is occurring. It has been shown by many researchers that Independent Component Analysis (ICA) captures these kind of dependencies better than any other formulation. Furthermore, ICA provides a topic decomposition that can be easily interpreted by humans compared to other models. This paper describes the development of a language model using ICA. The topic model is combined with a standard N-gram to produce the language model. The perplexity results obtained show that this language model is a viable language model for speech recognition purposes.
Keywords :
independent component analysis; speech recognition; ICA; automatic speech recognition; independent component analysis; language modeling; natural language; statistical language model; topic related long range dependencies; Analytical models; Computational modeling; History; Matrix decomposition; Semantics; Speech recognition; Vectors;
Conference_Titel :
Signal Processing Conference, 2005 13th European
Conference_Location :
Antalya
Print_ISBN :
978-160-4238-21-1