DocumentCode :
2695035
Title :
Speaker-independent recognition of spoken English letters
Author :
Cole, Ronald ; Fanty, Mark ; Muthusamy, Yeshwant ; Gopalakrishnan, Murali
fYear :
1990
fDate :
17-21 June 1990
Firstpage :
45
Abstract :
A description is presented of EAR, an English alphabet recognizer that performs speaker-independent recognition of letters spoken in isolation. During recognition, (a) signal processing routines transform the digitized speech into useful representations, (b) rules are applied to the representations to locate segment boundaries, (c) feature measurements are computed on the speech segments, and (d) a neural network uses the feature measurements to classify the letter. The system was trained on one token of each letter from 120 speakers. Performance was 95% when tested on a new set of 30 speakers. Performance was 96% when tested on a second token of each letter from the original 120 speakers. The recognition accuracy is 6% higher than that of previously reported systems. The high level of performance is attributed to accurate and explicit phonetic segmentation, the use of speech knowledge to select features that measure the important linguistic information, and the ability of the neural classifier to model the variability of the data
Keywords :
knowledge representation; neural nets; speech recognition; English alphabet recognizer; data variability modelling; digitized speech; error rate; feature measurements; letter tokens; linguistic information; multispeaker recognition; neural classifier; neural network; performance; phonetic segmentation; recognition accuracy; representations; segment boundaries; signal processing routines; speaker-independent recognition; speech knowledge; spoken English letters;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 1990., 1990 IJCNN International Joint Conference on
Conference_Location :
San Diego, CA, USA
Type :
conf
DOI :
10.1109/IJCNN.1990.137693
Filename :
5726652
Link To Document :
بازگشت