DocumentCode :
3372709
Title :
Robust speech recognition using feature-domain multi-channel bayesian estimators
Author :
Principi, Emanuele ; Rotili, Rudy ; Cifani, Simone ; Marinelli, Lorenzo ; Squartini, Stefano ; Piazza, Francesco
Author_Institution :
3MediaLabs, Univ. Politec. delle Marche, Ancona, Italy
fYear :
2010
fDate :
May 30 2010-June 2 2010
Firstpage :
2670
Lastpage :
2673
Abstract :
This paper proposes innovative multi-channel bayesian estimators in the feature-domain for robust speech recognition. Both minimum-mean-squared-error (MMSE) and maximum-a-posteriori (MAP) criteria have been explored: the related algorithms extend the multi-channel frequency-domain counterparts and generalize the single-channel feature-domain MMSE solution, recently appeared in the literature. Computer simulations conducted on a modified AURORA2 database show the efficacy of the frequency-domain multi-channel estimators when used as a pre-processing stage of a speech recognition engine, and that the proposed multi-channel MAP approach outperforms single-channel estimators by at least 3% on average.
Keywords :
Bayes methods; channel estimation; feature extraction; frequency-domain analysis; least mean squares methods; maximum likelihood estimation; speech recognition; MMSE; feature-domain analysis; frequency-domain analysis; maximum-a-posteriori algorithm; minimum mean square error; multichannel Bayesian estimator; speech recognition; Automatic speech recognition; Bayesian methods; Computer simulation; Engines; Frequency estimation; Microphones; Robustness; Spatial databases; Speech enhancement; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuits and Systems (ISCAS), Proceedings of 2010 IEEE International Symposium on
Conference_Location :
Paris
Print_ISBN :
978-1-4244-5308-5
Electronic_ISBN :
978-1-4244-5309-2
Type :
conf
DOI :
10.1109/ISCAS.2010.5537057
Filename :
5537057
Link To Document :
بازگشت