DocumentCode
698650
Title
Language modeling using Independent Component Analysis for Automatic Speech Recognition
Author
Kumaran, Raghunandan S. ; Narayanan, Karthik ; Gowdy, John N.
Author_Institution
Dept. of Electr. & Comput. Eng., Clemson Univ., Clemson, SC, USA
fYear
2005
fDate
4-8 Sept. 2005
Firstpage
1
Lastpage
4
Abstract
Conventional statistical language models such as N-grams are inadequate to model long distance dependencies in natural language. In this paper we propose a novel statistical language model to capture topic related long range dependencies. Humans have the inherent ability to identify long range dependencies in natural language. Given a set of related words humans can easily identify the context in which the set of words is occurring. It has been shown by many researchers that Independent Component Analysis (ICA) captures these kind of dependencies better than any other formulation. Furthermore, ICA provides a topic decomposition that can be easily interpreted by humans compared to other models. This paper describes the development of a language model using ICA. The topic model is combined with a standard N-gram to produce the language model. The perplexity results obtained show that this language model is a viable language model for speech recognition purposes.
Keywords
independent component analysis; speech recognition; ICA; automatic speech recognition; independent component analysis; language modeling; natural language; statistical language model; topic related long range dependencies; Analytical models; Computational modeling; History; Matrix decomposition; Semantics; Speech recognition; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2005 13th European
Conference_Location
Antalya
Print_ISBN
978-160-4238-21-1
Type
conf
Filename
7078242
Link To Document