Title :
City name recognition over the telephone
Author :
Fanty, Mark ; Schmid, Philipp ; Cole, Ronald
Author_Institution :
Oregon Grad. Inst. of Sci. & Technol., Beaverton, OR, USA
Abstract :
The authors present a neural-network-based speech recognition system for telephone speech. A neural network classifier provides phoneme probabilities for each frame of the utterance. A dynamic programming algorithm finds the most probable sequence of words. The classifier was trained on a spoken name corpus which contained the test vocabulary and many other words. The test set consisted of 262 utterances containing 44 cities and 2 states. The best result obtained on the test set was 92.9% word accuracy (90.1% on just the city names). Removing phoneme duration constraints reduced recognition accuracy to 82%. Performance fell to 82.4% using a network trained on a large vocabulary, fluent-speech corpus. Several other experiments are reported which did not produce significant changes in system performance.<>
Keywords :
dynamic programming; neural nets; speech recognition; telephony; city name recognition; dynamic programming algorithm; neural network classifier; phoneme probabilities; system performance; telephone speech; test vocabulary; word accuracy;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
Conference_Location :
Minneapolis, MN, USA
Print_ISBN :
0-7803-7402-9
DOI :
10.1109/ICASSP.1993.319177