DocumentCode
394349
Title
Pronunciation variation speech recognition without dictionary modification on sparse database
Author
Kanokphara, Supphanat ; Tesprasit, Virongrong ; Thongprasirt, Rachod
Author_Institution
Inf. R&D Div., Nat. Electron. & Comput. Technol. Center (NECTEC), Pathumthani, Thailand
Volume
1
fYear
2003
fDate
6-10 April 2003
Abstract
Generally, a speech recognition system uses a fixed set of pronunciations according to the dictionary for training and decoding. However, even a well-defined lexicon cannot be used to support all variations in human pronunciation. Besides, in order to cover all possible pronunciations, the size of the dictionary would be too large to implement. Sharing Gaussian densities across phonetic models and decision tree for pronunciation variation is proved to be efficient for a pronunciation variation system without dictionary modification. This paper presents the alternative methods that can be used even in the sparse database situation. Re-label training is modified to have rule-based pronunciation variation in order to obtain real phonetic acoustic models. Phonemic acoustic models are then retrained from the tying HMM states across phonetic models. These new phonemic models allow an alternative search path during recognition. The system shows better performance in the experiment.
Keywords
Gaussian distribution; decision trees; hidden Markov models; search problems; speech processing; speech recognition; Gaussian densities; acoustic models; alternative search path; decision tree; decoding; performance; phonemic models; phonetic models; pronunciation variation speech recognition; re-label training; sparse database; training; tying HMM states; Databases; Decision trees; Decoding; Dictionaries; Hidden Markov models; Loudspeakers; Research and development; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1198893
Filename
1198893
Link To Document