DocumentCode
3783763
Title
Innovative approaches for large vocabulary name recognition
Author
Yuqing Gao;B. Ramabhadran;J. Chen;H. Erdogan;M. Picheny
Author_Institution
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Volume
1
fYear
2001
fDate
6/23/1905 12:00:00 AM
Firstpage
53
Abstract
Automatic name dialing is a practical and interesting application of speech recognition on telephony systems. The IBM name recognition system is a large vocabulary, speaker independent system currently in use for reaching IBM employees in the United States. We present some innovative algorithms that improve name recognition accuracy. Unlike transcription tasks, such as the Switchboard task, recognition of names poses a variety of different problems. Several of these problems arise from the fact that foreign names are hard to pronounce for speakers who are not familiar with the names and that there are no standardized methods for pronouncing proper names. Noise robustness is another very important factor as these calls are typically made in noisy environments, such as from a car, cafeteria, airport, etc. and over different kinds of cellular and land-line telephone channels. We have performed a systematic analysis of the speech recognition errors and tackled the issues separately with techniques ranging from weighted speaker clustering, massive adaptation, rapid and unsupervised adaptation methods to pronunciation modeling methods. We find that the decoding accuracy can be improved significantly (28% relative) in this manner.
Keywords
"Vocabulary","Speech recognition","Telephony","Working environment noise","Hidden Markov models","Speech analysis","Clustering algorithms","Loudspeakers","Speech coding","Testing"
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP ´01). 2001 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7041-4
Type
conf
DOI
10.1109/ICASSP.2001.940765
Filename
940765
Link To Document