مرکز منطقه ای اطلاع رساني علوم و فناوري - Thai Speech Keyword Spotting using Heterogeneous Acoustic Modeling

DocumentCode :

2753670

Title :

Thai Speech Keyword Spotting using Heterogeneous Acoustic Modeling

Author :

Tangruamsub, Sirinart ; Punyabukkana, Proadpran ; Suchato, Atiwong

Author_Institution :

Spoken Language Syst. Res. group, Chulalongkorn Univ., Bangkok

fYear :

2007

fDate :

5-9 March 2007

Firstpage :

253

Lastpage :

260

Abstract :

This paper illustrates the use of acoustic modeling of three different structures, including syllables, fillers and keywords, for keyword spotting. Filler models and syllable models are applied to capture out-of-vocabulary words, while keyword models extract significant words from speech utterances. Grammatical details are utilized with syllable models to add extra domain constraints. This improves the system´s ability to detect non-keyword vocabularies. Filler models associating with syllable models reduce false alarm of keyword detection. Three kinds of filler models are described. Different types of filler models perform differently in keyword spotting of utterances with only one keyword and ones with multiple keywords. Experiments are conducted on a telephone call transferring via Thai spoken language domain. The proposed method is compared with a limited vocabulary speech recognition and keyword spotting using a reward function. For single- keyword utterances, the best accuracy obtained using the proposed method is approximately 70%, which is better than the ones from LVSR and spotting via reward functions. For multiple-keyword utterances, the best precision and recall rates are 72% and 65%, respectively. These are marginally better than ones obtained from limited vocabulary speech recognition, while typical reward function approach yields the rates of less than 50%.

Keywords :

acoustic signal detection; computational linguistics; grammars; hidden Markov models; speech recognition; Thai speech keyword spotting; filler model; grammar; heterogeneous acoustic modeling; hidden Markov model; keyword model; nonkeyword vocabulary detection; out-of-vocabulary word; speech recognition; speech utterance; syllable model; Acoustical engineering; Application software; Automatic speech recognition; Hidden Markov models; Natural languages; Noise robustness; Speech enhancement; Speech recognition; Telephony; Vocabulary; Acoustic Modeling using Hidden Markov Models; Keyword Spotting from Speech Utterances; Speech Recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Research, Innovation and Vision for the Future, 2007 IEEE International Conference on

Conference_Location :

Hanoi

Print_ISBN :

1-4244-0694-3

Type :

conf

DOI :

10.1109/RIVF.2007.369165

Filename :

4223082

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2753670