A speaker independent word recognition system based on phoneme recognition for a large size (212 words) vocabulary

Author

Makino, Shozo ; Kido, Ken´iti

Author_Institution

Tohoku University, Sendai, Japan

Volume

9

fYear

1984

fDate

30742

Firstpage

29

Lastpage

32

Abstract

This paper describes the speaker-independent spoken word recognition system for a large size vocabulary. Speech is analyzed by the filter bank, from whose logarithmic spectrum the 11 features are extracted every 10 ms. Using the features the speech is first segmented and the primary phoneme recognition is carried out for every segment using the Bayes decision method. After correcting errors in segmentation and phoneme recognition, the secondary recognition of part of the consonants is carried out and the phonemic sequence is determined. The word dictionary item having maximum likelihood to the sequence is chosen as the recognition output. The 75.9% score for the phoneme recognition and the 92.4% score for the word recognition are obtained for the training samples in the 212 words uttered by 10 male and 10 female speakers. For the same words uttered by 30 male and 20 female speakers different from the above speakers, the 88.1% word recognition score is obtained.

Keywords

Data mining; Dictionaries; Error correction; Feature extraction; Filter bank; Frequency; Pattern recognition; Speech analysis; Speech recognition; Vocabulary;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '84.

Type

conf

DOI

10.1109/ICASSP.1984.1172568

Filename

1172568