DocumentCode
3102230
Title
System combination for improved automatic generation of N-best proper nouns pronunciation
Author
Duncan, Richard
Author_Institution
Mississippi State Univ., MS, USA
fYear
2001
fDate
2001
Firstpage
208
Lastpage
212
Abstract
Proper nouns present a challenging problem for current speech recognition technology since they often do not follow typical letter-to-sound conversion rules. Several different automated methods, Boltzmann machines, decision trees, and recurrent neural networks have been attempted in the literature, yet no single system has achieved an acceptable error rate. Since the project goal is the generation of pronunciation dictionaries for speech recognition, however, we can easily combine the multiple outputs of the multiple systems and use the total database coverage as our scoring metric. For generating at least one correct pronunciation for all names, combining all systems gives us a 19.6% error rate, a 23.1% absolute reduction over the best previous system. For generating every pronunciation in the database the combined system rates at 29.1%, a 23.6% reduction
Keywords
Boltzmann machines; decision trees; error statistics; recurrent neural nets; speech recognition; Boltzmann machines; N-best proper nouns pronunciation; automated methods; automatic generation; decision trees; error rate; letter-to-sound conversion rules; multiple outputs; multiple systems; pronunciation; pronunciation dictionaries; recurrent neural networks; scoring metric; speech recognition; system combination; Databases; Decision trees; Dictionaries; Electronic mail; Error analysis; Recurrent neural networks; Robustness; Speech recognition; USA Councils; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
SoutheastCon 2001. Proceedings. IEEE
Conference_Location
Clemson, SC
Print_ISBN
0-7803-6748-0
Type
conf
DOI
10.1109/SECON.2001.923117
Filename
923117
Link To Document