DocumentCode
310564
Title
Direct identification vs. correlated models to process acoustic and articulatory informations in automatic speech recognition
Author
André-Obrecht, Regine ; Jacob, Bruno
Author_Institution
IRIT, Univ. Paul Sabatier, Toulouse, France
Volume
2
fYear
1997
fDate
21-24 Apr 1997
Firstpage
999
Abstract
Our work deals with the classical problem of merging heterogenous and asynchronous parameters. It is well known that lip reading improves the speech recognition score, especially in noise conditions; so we study more precisely the modeling of the acoustic and labial parameters to propose two automatic speech recognition systems: a direct identification is performed by using a classical HMM approach, no correlation between visual and acoustic parameters is assumed; and two correlated models, a master HMM and a slave HMM, process respectively the labial observations and the acoustic ones. To assess each approach, we use a segmental pre-processing method. Our task is the recognition of spelled French letters, in clear and noisy (cocktail party) environments. Whatever the approach and conditions, the introduction of labial features improves the performance, but the difference between the two models is not enough sufficient to provide any priority
Keywords
acoustic signal processing; correlation methods; feature extraction; hidden Markov models; image processing; natural languages; noise; speech processing; speech recognition; HMM; acoustic information processing; acoustic parameters modeling; articulatory information processing; asynchronous parameters; automatic speech recognition; clear environment; cocktail party environment; correlated models; direct identification; heterogenous parameters; labial features; labial parameters modeling; linguistic decoder; lip reading; master HMM; noise conditions; noisy environment; segmental preprocessing method; slave HMM; speech recognition score; spelled French letters; visual parameters; Acoustic noise; Automatic speech recognition; Decoding; Hidden Markov models; Jacobian matrices; Lips; Master-slave; Optical noise; Speech recognition; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location
Munich
ISSN
1520-6149
Print_ISBN
0-8186-7919-0
Type
conf
DOI
10.1109/ICASSP.1997.596108
Filename
596108
Link To Document