DocumentCode
507311
Title
Sample-Based Automatic Dictionary Generation for Keyword Spotting System
Author
Lu, Li ; Ge, Fengpei ; Li, Ta ; Zhao, Qingwei ; Yan, Yonghong
Author_Institution
Inst. of Acoust., Chinese Acad. of Sci. Beijing, Beijing, China
Volume
5
fYear
2009
fDate
14-16 Aug. 2009
Firstpage
505
Lastpage
508
Abstract
In this paper we develop an approach to automatic, data-driven generation of pronunciation dictionaries for keyword spotting(KWS) systems. In practical applications, KWS tasks often have to deal with keywords whose pronunciations can not be found in the dictionary. To solve this problem, we study how to derive pronunciations automatically from speech samples of keywords. Recognized sequences from these samples are used as candidates, and merged to form a phoneme confusion network(PCN) from which the pronunciations are extracted based on a confidence-based metric. Experimental results show that sample-based dictionary reaches similar performance with the canonical dictionary, and the proposed approach is independent of the sample set.
Keywords
dictionaries; speech recognition; canonical dictionary; confidence-based metric; data-driven generation; keyword spotting system; phoneme confusion network; pronunciation dictionary; pronunciations; recognized sequences; sample-based automatic dictionary generation; speech samples; Acoustic measurements; Automatic speech recognition; Dictionaries; Fuzzy systems; Hidden Markov models; Interpolation; Lattices; Merging; Personal communication networks; Speech recognition; Confidence Metric; Data-driven; Phoneme Confusion Network; Pronunciation Extraction;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on
Conference_Location
Tianjin
Print_ISBN
978-0-7695-3735-1
Type
conf
DOI
10.1109/FSKD.2009.506
Filename
5360571
Link To Document