DocumentCode :
2704844
Title :
Combination of Recognizers and Fusion of Features Approach to Missing Data ASR Under Non-Stationary Noise Conditions
Author :
Joshi, Neil ; Guan, Ling
Author_Institution :
Dept. of Electr. & Comput. Eng., Ryerson Univ., Toronto, Ont.
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
The difficulty of ASR under non-stationary noise conditions is a major contributing factor hindering the widespread deployment of ASR systems. Bottom up techniques such as speech noise separation and top down methods to adapt the acoustic model to the environment have been applied to address the issue. The missing data approach to ASR improves upon existing techniques basing recognition solely on the reliable components of the signal and has been demonstrated as an effective method to handle non-stationarity. Proposed in this paper is a novel technique whereby ASR using missing data theory under non-stationary noise conditions is improved by use of a fusion of models at the decision level. This fused model introduces more resilient features to the missing data decode process. The fused decoder is found to significantly increase recognition performance over conventional missing data techniques. A major finding in this paper is when the fused decoder exhibits the fusion of bottom up and top down processes. Under this condition, the proposed combination of recognizers technique is found to outperform all other tested ASR systems.
Keywords :
decoding; feature extraction; speech coding; speech recognition; acoustic model; features fusion; features recognizers; missing data ASR; missing data decode process; non-stationary noise conditions; speech noise separation; Acoustic noise; Automatic speech recognition; Decoding; Hidden Markov models; Mel frequency cepstral coefficient; Speech enhancement; Speech processing; Speech recognition; System testing; Working environment noise; Hidden Markov models; Pattern recognition; Speech processing; Speech recognition; Time Series;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2007.367251
Filename :
4218282
Link To Document :
بازگشت