Speaker-independent recognition of spoken English letters

Author

Cole, Ronald ; Fanty, Mark ; Muthusamy, Yeshwant ; Gopalakrishnan, Murali

fYear

1990

fDate

17-21 June 1990

Firstpage

45

Abstract

A description is presented of EAR, an English alphabet recognizer that performs speaker-independent recognition of letters spoken in isolation. During recognition, (a) signal processing routines transform the digitized speech into useful representations, (b) rules are applied to the representations to locate segment boundaries, (c) feature measurements are computed on the speech segments, and (d) a neural network uses the feature measurements to classify the letter. The system was trained on one token of each letter from 120 speakers. Performance was 95% when tested on a new set of 30 speakers. Performance was 96% when tested on a second token of each letter from the original 120 speakers. The recognition accuracy is 6% higher than that of previously reported systems. The high level of performance is attributed to accurate and explicit phonetic segmentation, the use of speech knowledge to select features that measure the important linguistic information, and the ability of the neural classifier to model the variability of the data

Keywords

knowledge representation; neural nets; speech recognition; English alphabet recognizer; data variability modelling; digitized speech; error rate; feature measurements; letter tokens; linguistic information; multispeaker recognition; neural classifier; neural network; performance; phonetic segmentation; recognition accuracy; representations; segment boundaries; signal processing routines; speaker-independent recognition; speech knowledge; spoken English letters;

fLanguage

English

Publisher

ieee

Conference_Titel

Neural Networks, 1990., 1990 IJCNN International Joint Conference on

Conference_Location

San Diego, CA, USA

Type

conf

DOI

10.1109/IJCNN.1990.137693

Filename

5726652