Title :
Classification of coarse phonetic categories in continuous speech: statistical classifiers vs. temporal flow connectionist network
Author :
Aktas, A. ; Schmidbauer, O. ; Maier, K.H. ; Feix, W.H.
Author_Institution :
Siemens AG, Munchen, West Germany
Abstract :
A comparison of the temporal flow model (TFM) as a connectionist approach with statistical methods like the hidden Markov model (HMM) and the maximum-likelihood (ML) classifier on the basis of frame and segment recognition experiments is presented. All three methods were applied to a coarse phonetic classification task in a speaker-dependent mode. The seven coarse phonetic categories (CPCs) used correspond to the categories of manner of articulation. The experiments were performed on manually labeled continuous-speech data incorporating two versions of 50 phonetically balanced sentences. A short time cepstral representation of the speech data was chosen as the basis for all classification experiments. The best results were achieved with a context-dependent HMM. Experiments without the use of segment context noticeably yield better overall results for the TFM. Both are found to be superior to the ML classifier
Keywords :
speech analysis and processing; speech recognition; German language; classification experiments; coarse phonetic classification; context-dependent HMM; continuous speech; hidden Markov model; manually labeled continuous-speech data; maximum likelihood classifier; phonetically balanced sentences; short time cepstral representation; speaker-dependent mode; statistical classifiers; temporal flow connectionist network; temporal flow model; Cepstral analysis; Frequency; Hidden Markov models; Probability; Research and development; Signal analysis; Speech; Speech analysis; Speech recognition; Statistical analysis; Viterbi algorithm;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on
Conference_Location :
Albuquerque, NM
DOI :
10.1109/ICASSP.1990.115544