DocumentCode
302325
Title
Explicit modeling of coarticulation in a statistical speech recognizer
Author
Chen, Ruxin ; Jamieson, Leah H.
Author_Institution
Sch. of Electr. & Comput. Eng., Purdue Univ., West Lafayette, IN, USA
Volume
1
fYear
1996
fDate
7-10 May 1996
Firstpage
463
Abstract
This paper presents a new statistical speech model in which coarticulation is modeled explicitly. Unlike HMMs, in which the current state depends only on the previous state and the current observation, the proposed model supports dependence on the previous and next states and on the previous and current observations. The degree of coarticulation between adjacent phones is modeled parametrically, and can be adjusted according to a parameter representing the speaking rate. The model also incorporates a parameter that represents a frame-by-frame measure of confidence in the speech. We present two methods for solving the system parameters: one based on the K-means method, and a novel method based on explicitly minimizing a measure of the segmentation error. A new, efficient forward algorithm and the use of top candidates in the search greatly reduce the computational complexity. In evaluation on the TIMIT data base, we achieve a phone recognition rate of 77.1%
Keywords
computational complexity; parameter estimation; speech processing; speech recognition; statistical analysis; K-means method; TIMIT data base; adjacent phones; coarticulation; computational complexity reduction; explicit modeling; forward algorithm; frame by frame measure; phone recognition rate; segmentation error; speaking rate; speech confidence; statistical speech model; statistical speech recognizer; system parameters; Computational complexity; Context modeling; Density functional theory; Hidden Markov models; Interpolation; Laser sintering; Speech recognition; Terminology; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location
Atlanta, GA
ISSN
1520-6149
Print_ISBN
0-7803-3192-3
Type
conf
DOI
10.1109/ICASSP.1996.541133
Filename
541133
Link To Document