Title :
Thai Speech Keyword Spotting using Heterogeneous Acoustic Modeling
Author :
Tangruamsub, Sirinart ; Punyabukkana, Proadpran ; Suchato, Atiwong
Author_Institution :
Spoken Language Syst. Res. group, Chulalongkorn Univ., Bangkok
Abstract :
This paper illustrates the use of acoustic modeling of three different structures, including syllables, fillers and keywords, for keyword spotting. Filler models and syllable models are applied to capture out-of-vocabulary words, while keyword models extract significant words from speech utterances. Grammatical details are utilized with syllable models to add extra domain constraints. This improves the system´s ability to detect non-keyword vocabularies. Filler models associating with syllable models reduce false alarm of keyword detection. Three kinds of filler models are described. Different types of filler models perform differently in keyword spotting of utterances with only one keyword and ones with multiple keywords. Experiments are conducted on a telephone call transferring via Thai spoken language domain. The proposed method is compared with a limited vocabulary speech recognition and keyword spotting using a reward function. For single- keyword utterances, the best accuracy obtained using the proposed method is approximately 70%, which is better than the ones from LVSR and spotting via reward functions. For multiple-keyword utterances, the best precision and recall rates are 72% and 65%, respectively. These are marginally better than ones obtained from limited vocabulary speech recognition, while typical reward function approach yields the rates of less than 50%.
Keywords :
acoustic signal detection; computational linguistics; grammars; hidden Markov models; speech recognition; Thai speech keyword spotting; filler model; grammar; heterogeneous acoustic modeling; hidden Markov model; keyword model; nonkeyword vocabulary detection; out-of-vocabulary word; speech recognition; speech utterance; syllable model; Acoustical engineering; Application software; Automatic speech recognition; Hidden Markov models; Natural languages; Noise robustness; Speech enhancement; Speech recognition; Telephony; Vocabulary; Acoustic Modeling using Hidden Markov Models; Keyword Spotting from Speech Utterances; Speech Recognition;
Conference_Titel :
Research, Innovation and Vision for the Future, 2007 IEEE International Conference on
Conference_Location :
Hanoi
Print_ISBN :
1-4244-0694-3
DOI :
10.1109/RIVF.2007.369165