DocumentCode :
1020834
Title :
A projection-based likelihood measure for speech recognition in noise
Author :
Carlson, Beth A. ; Clements, Mark A.
Author_Institution :
Dept. of Electr. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
Volume :
2
Issue :
1
fYear :
1994
Firstpage :
97
Lastpage :
102
Abstract :
Investigates a projection-based likelihood measure that significantly improves automatic speech recognition performance in the presence of additive broadband noise. The measure was developed by modifying likelihood scores in continuous Gaussian density hidden Markov models (HMMs), resulting in the weighted projection measure (WPM). Experimental results using the proposed measure are reported for several performance factors: different cepstral-based parameters, normal and multistyle speech, and various noise signals, including white, jittering white, and broadband colored noise. In all cases, significant improvements in speaker-dependent, isolated word recognition were achieved using the WPM instead of the standard Gaussian likelihood measure (weighted Euclidean distance (WED)). As an example, at a SNR of 5 dB, the WPM resulted in improvement in recognition accuracy from 19.4 to 80.6% compared with the standard WED for the DFT mel-cepstral representation.
Keywords :
acoustic noise; hidden Markov models; interference suppression; random noise; speech analysis and processing; speech recognition; WED; additive broadband noise; automatic speech recognition performance; broadband colored noise; cepstral-based parameters; continuous Gaussian density hidden Markov models; jittering white noise; multistyle speech; noise signals; normal speech; projection-based likelihood measure; speaker-dependent isolated word recognition; speech recognition; weighted Euclidean distance; weighted projection measure; white noise; Additive noise; Automatic speech recognition; Colored noise; Density measurement; Hidden Markov models; Noise measurement; Speech enhancement; Speech recognition; Weight measurement; White noise;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.260341
Filename :
260341
Link To Document :
بازگشت