DocumentCode :
1096892
Title :
A parametric representation and a clustering method for phoneme recognition--Application to stops in a CV environment
Author :
Tanaka, Kazuyo
Author_Institution :
Ministry of International Trade and Industry, Ibaraki, Japan.
Volume :
29
Issue :
6
fYear :
1981
fDate :
12/1/1981 12:00:00 AM
Firstpage :
1117
Lastpage :
1127
Abstract :
A new method of representing phonemic categories and determining their standard values from a training sample distribution is presented. It is an essential part of a phoneme recognition system aiming at speaker-independent speech recognition. The phonemic value of a short-duration speech signal of up to 50 ms is represented by a matrix composed of acoustic parameters. Standard phonemic categories (SPC´s) are defined by a combination of several simple potential functions in this matrix space. The potential function set, as well as its number, is determined automatically by the proposed method. Processing is primarily by algebraic operation and is formulated according to an analogy to particle dynamics. The method is applied to voiceless and voiced stop consonant sets spoken by twelve speakers. The relationship between the classification rate and the number of SPC´s is investigated under several initial conditions. Stop consonant recognition tests in CV-syllables are made using derived SPC sets irrespective of following vowels. Recognition rates for the utterances of four speakers not included among the twelve speakers used for training were 84 percent for voiceless and 81 percent for voiced stops.
Keywords :
Clustering methods; Feature extraction; Helium; Instruments; Isolation technology; Loudspeakers; Speech recognition; Standards development; Testing; Vocabulary;
fLanguage :
English
Journal_Title :
Acoustics, Speech and Signal Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
0096-3518
Type :
jour
DOI :
10.1109/TASSP.1981.1163693
Filename :
1163693
Link To Document :
بازگشت