DocumentCode :
388414
Title :
A comparison of learning techniques in speech recognition
Author :
Bradshaw, Gary L. ; Cole, Ron ; Li, Zongge
Author_Institution :
Carnegie-Mellon university, Pittsburgh, Pa.
Volume :
7
fYear :
1982
fDate :
30072
Firstpage :
554
Lastpage :
557
Abstract :
Template-based recognition systems overcome errors in the short-term matching process by comparing whole sequences of acoustic events. In many vocabularies, each word has a highly distinctive sequence. Some vocabularies have confusable words with very similar sequences, leading to poor recognition performance. Improvements in discriminability among similar words may be achieved by altering the matching algorithm, or by improving the reference template set. Both techniques are instances of multi-exemplar learning techniques which improve recognition performance through automatic evaluation of training data. This paper examines several such techniques using isolated utterances and highly ambiguous vocabularies (e.g., the "E" set; 3 B C D E G P V T Z) in a speaker-dependent recognition system. A system which combined both featural and template information led to the best performance for six out of eight speakers. Using this technique, E-set error rates improved from 37% to 10%.
Keywords :
Computer errors; Computer science; Data mining; Error analysis; Monitoring; Speech processing; Speech recognition; Training data; US Department of Defense; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '82.
Type :
conf
DOI :
10.1109/ICASSP.1982.1171636
Filename :
1171636
Link To Document :
بازگشت