DocumentCode
697998
Title
Filler models for Automatic Speech Recognition created from Hidden Markov Models using the K-Means algorithm
Author
Dunnachie, Matthew E. ; Shields, Paul W. ; Crawford, David H. ; Davies, Mike
Author_Institution
Alba Centre, Inst. for Syst. Level Integration, Livingston, UK
fYear
2009
fDate
24-28 Aug. 2009
Firstpage
544
Lastpage
548
Abstract
In Automatic Speech Recognition (ASR), the presence of Out Of Vocabulary (OOV) words or sounds, within the speech signal, can have a detrimental effect on recognition performance. One common method of solving this problem is to use filler models to absorb the unwanted OOV utterances. A balance between accepting In Vocabulary (IV) words and rejecting OOV words can be achieved by manipulating the values of Word Insertion Penalty and Filler Insertion Penalty. This paper investigates the ability of three different classes of HMM filler models, K-Means, Mean and Baum-Welch, to discriminate between IV and OOV words. The results show that using the Baum-Welch trained HMMs 97.0% accuracy is possible for keyword IV acceptance and OOV rejection. The K-Means filler models provide the highest IV acceptance score of 97.3% but lower overall accuracy. However, the computational complexity of the K-Means algorithm is significantly lower and requires no additional speech training data.
Keywords
computational complexity; hidden Markov models; pattern clustering; speech recognition; vocabulary; word processing; HMM filler model; IV words; K-means algorithm; Mean and Baum-Welch; OOV sound; automatic speech recognition; computational complexity; filler insertion penalty; hidden Markov model; in vocabulary words; out of vocabulary words; unwanted OOV utterances; word insertion penalty; Batteries; Cameras; Clocks; Computers; Hidden Markov models; Microphones; Video equipment;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2009 17th European
Conference_Location
Glasgow
Print_ISBN
978-161-7388-76-7
Type
conf
Filename
7077572
Link To Document