DocumentCode :
1439163
Title :
A general joint additive and convolutive bias compensation approach applied to noisy Lombard speech recognition
Author :
Afify, Mohamed ; Gong, Yifan ; Haton, Jean-Paul
Author_Institution :
LORIA, Univ. Henri Poincare, Vandoeuvre, France
Volume :
6
Issue :
6
fYear :
1998
fDate :
11/1/1998 12:00:00 AM
Firstpage :
524
Lastpage :
538
Abstract :
A unified approach to the acoustic mismatch problem is proposed. A maximum likelihood state-based additive bias compensation algorithm is developed for the continuous density hidden Markov model (CDHMM). Based on this technique, specific bias models in the mel cepstral and the linear spectral domains are presented. Among these models, a new polynomial trend bias model in the mel cepstral domain is derived, which proved effective for Lombard speech compensation. In addition, a joint estimation algorithm for additive and convolutive bias compensation is proposed. This algorithm is based on applying the expectation maximization (EM) technique in both above-mentioned domains, in conjunction with a parallel model combination (PMC) based transformation. The compensation of the dynamic (difference) coefficients in the proposed framework is also studied. The evaluation data base consists of a 21 confusable word vocabulary uttered by 24 speakers. Three mismatched versions of the data base are considered, i.e., Lombard speech, 15 dB noisy Lombard speech, and 5 dB noisy Lombard speech. The proposed techniques result in 50.9%, 74.6%, and 67.3% reduction in the performance difference between matched and uncompensated word error rates for the three mismatch conditions, respectively. When dynamic coefficients are considered the corresponding reductions are 46.8%, 72.4%, and 70.9%
Keywords :
cepstral analysis; convolution; hidden Markov models; maximum likelihood estimation; noise; speech recognition; 15 dB noisy Lombard speech; 5 dB noisy Lombard speech; CDHMM; Lombard speech compensation; acoustic mismatch problem; bias models; confusable word vocabulary; continuous density hidden Markov model; convolutive bias compensation; dynamic difference coefficients; evaluation data base; expectation maximization; general joint additive bias compensation; joint estimation algorithm; linear spectral domain; matched word error rate; maximum likelihood state-based bias compensation; mel cepstral domain; mismatch conditions; noisy Lombard speech recognition; parallel model combination based transformation; polynomial trend bias model; uncompensated word error rate; unified approach; Acoustic noise; Additive noise; Additive white noise; Cepstral analysis; Communication channels; Hidden Markov models; Maximum likelihood estimation; Speech enhancement; Speech recognition; Testing;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.725319
Filename :
725319
Link To Document :
بازگشت