مرکز منطقه ای اطلاع رساني علوم و فناوري - Explicit modeling of coarticulation in a statistical speech recognizer

DocumentCode :

302325

Title :

Explicit modeling of coarticulation in a statistical speech recognizer

Author :

Chen, Ruxin ; Jamieson, Leah H.

Author_Institution :

Sch. of Electr. & Comput. Eng., Purdue Univ., West Lafayette, IN, USA

Volume :

fYear :

1996

fDate :

7-10 May 1996

Firstpage :

463

Abstract :

This paper presents a new statistical speech model in which coarticulation is modeled explicitly. Unlike HMMs, in which the current state depends only on the previous state and the current observation, the proposed model supports dependence on the previous and next states and on the previous and current observations. The degree of coarticulation between adjacent phones is modeled parametrically, and can be adjusted according to a parameter representing the speaking rate. The model also incorporates a parameter that represents a frame-by-frame measure of confidence in the speech. We present two methods for solving the system parameters: one based on the K-means method, and a novel method based on explicitly minimizing a measure of the segmentation error. A new, efficient forward algorithm and the use of top candidates in the search greatly reduce the computational complexity. In evaluation on the TIMIT data base, we achieve a phone recognition rate of 77.1%

Keywords :

computational complexity; parameter estimation; speech processing; speech recognition; statistical analysis; K-means method; TIMIT data base; adjacent phones; coarticulation; computational complexity reduction; explicit modeling; forward algorithm; frame by frame measure; phone recognition rate; segmentation error; speaking rate; speech confidence; statistical speech model; statistical speech recognizer; system parameters; Computational complexity; Context modeling; Density functional theory; Hidden Markov models; Interpolation; Laser sintering; Speech recognition; Terminology; Testing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on

Conference_Location :

Atlanta, GA

ISSN :

1520-6149

Print_ISBN :

0-7803-3192-3

Type :

conf

DOI :

10.1109/ICASSP.1996.541133

Filename :

541133

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=302325