مرکز منطقه ای اطلاع رساني علوم و فناوري - Vector equalization in hidden Markov models for noisy speech recognition

DocumentCode :

284643

Title :

Vector equalization in hidden Markov models for noisy speech recognition

Author :

Juang, B.H. ; Paliwal, K.K.

Author_Institution :

AT&T Bell Labs., Murray Hill, NJ, USA

Volume :

fYear :

1992

fDate :

23-26 Mar 1992

Firstpage :

301

Abstract :

Speech recognizers often experience serious performance degradation when deployed in an unknown acoustic (particularly, noise contaminated) environment. To combat this problem, the authors proposed in a previous study a distortion measure that takes into account the norm shrinkage bias in the noisy cepstrum. The authors incorporate a first-order equalization mechanism, specifically aimed at avoiding the norm shrinkage problem, in a hidden Markov model (HMM) framework to model the speech cepstral sequence. Such a modeling technique requires special care as the formulation inevitably involves parameter estimation from a set of data with singular dispersion. The authors provide solutions to this HMM stochastic modeling problem and give algorithms for estimating the necessary model parameters. They experimentally show that incorporation of the first-order normal equalization model makes the HMM-based speech recognizer robust to noise. With respect to a conventional HMM recognizer, this leads to an improvement in recognition performance which is equivalent to about 15-20-dB gain in signal-to-noise ratio

Keywords :

acoustic noise; hidden Markov models; speech recognition; 15 to 20 dB; HMM; acoustic environment; algorithms; distortion measure; first-order equalization; hidden Markov model; noisy cepstrum; noisy speech recognition; norm shrinkage bias; parameter estimation; recognition performance; signal-to-noise ratio; singular dispersion; speech cepstral sequence; speech recognizer; stochastic modeling; vector equalisation; Acoustic distortion; Acoustic measurements; Acoustic noise; Degradation; Distortion measurement; Hidden Markov models; Pollution measurement; Speech enhancement; Speech recognition; Working environment noise;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on

Conference_Location :

San Francisco, CA

ISSN :

1520-6149

Print_ISBN :

0-7803-0532-9

Type :

conf

DOI :

10.1109/ICASSP.1992.225912

Filename :

225912

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=284643