Title :
Explicit modeling of coarticulation in a statistical speech recognizer
Author :
Chen, Ruxin ; Jamieson, Leah H.
Author_Institution :
Sch. of Electr. & Comput. Eng., Purdue Univ., West Lafayette, IN, USA
Abstract :
This paper presents a new statistical speech model in which coarticulation is modeled explicitly. Unlike HMMs, in which the current state depends only on the previous state and the current observation, the proposed model supports dependence on the previous and next states and on the previous and current observations. The degree of coarticulation between adjacent phones is modeled parametrically, and can be adjusted according to a parameter representing the speaking rate. The model also incorporates a parameter that represents a frame-by-frame measure of confidence in the speech. We present two methods for solving the system parameters: one based on the K-means method, and a novel method based on explicitly minimizing a measure of the segmentation error. A new, efficient forward algorithm and the use of top candidates in the search greatly reduce the computational complexity. In evaluation on the TIMIT data base, we achieve a phone recognition rate of 77.1%
Keywords :
computational complexity; parameter estimation; speech processing; speech recognition; statistical analysis; K-means method; TIMIT data base; adjacent phones; coarticulation; computational complexity reduction; explicit modeling; forward algorithm; frame by frame measure; phone recognition rate; segmentation error; speaking rate; speech confidence; statistical speech model; statistical speech recognizer; system parameters; Computational complexity; Context modeling; Density functional theory; Hidden Markov models; Interpolation; Laser sintering; Speech recognition; Terminology; Testing;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
0-7803-3192-3
DOI :
10.1109/ICASSP.1996.541133