Title :
A projection-based likelihood measure for speech recognition in noise
Author :
Carlson, Beth A. ; Clements, Mark A.
Author_Institution :
Dept. of Electr. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
Abstract :
Investigates a projection-based likelihood measure that significantly improves automatic speech recognition performance in the presence of additive broadband noise. The measure was developed by modifying likelihood scores in continuous Gaussian density hidden Markov models (HMMs), resulting in the weighted projection measure (WPM). Experimental results using the proposed measure are reported for several performance factors: different cepstral-based parameters, normal and multistyle speech, and various noise signals, including white, jittering white, and broadband colored noise. In all cases, significant improvements in speaker-dependent, isolated word recognition were achieved using the WPM instead of the standard Gaussian likelihood measure (weighted Euclidean distance (WED)). As an example, at a SNR of 5 dB, the WPM resulted in improvement in recognition accuracy from 19.4 to 80.6% compared with the standard WED for the DFT mel-cepstral representation.
Keywords :
acoustic noise; hidden Markov models; interference suppression; random noise; speech analysis and processing; speech recognition; WED; additive broadband noise; automatic speech recognition performance; broadband colored noise; cepstral-based parameters; continuous Gaussian density hidden Markov models; jittering white noise; multistyle speech; noise signals; normal speech; projection-based likelihood measure; speaker-dependent isolated word recognition; speech recognition; weighted Euclidean distance; weighted projection measure; white noise; Additive noise; Automatic speech recognition; Colored noise; Density measurement; Hidden Markov models; Noise measurement; Speech enhancement; Speech recognition; Weight measurement; White noise;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on