DocumentCode :
2516140
Title :
Speech Magnitude-Spectrum Information-Entropy (MSIE) for Automatic Speech Recognition in Noisy Environments
Author :
Nolazco-Flores, J.A. ; Aceves L, Roberto A ; García-Perera, L. Paola
Author_Institution :
Comput. Sci. Dept., Inst. Tecnol. y de Estudios Superiores de Monterrey, Monterrey, Mexico
fYear :
2010
fDate :
23-26 Aug. 2010
Firstpage :
4364
Lastpage :
4367
Abstract :
The Magnitude-Spectrum Information-Entropy (MSIE) of the speech signal is presented as an alternative representation of the speech that can be used to mitigate the mismatch between training and testing conditions. The speech-magnitude spectrum is considered as a random variable from which entropy coefficients can be calculated for each frame. By concatenating these entropic coefficients to its corresponding MFCC vector, then calculating the dynamic coefficients, Δ and ΔΔ, the results show an improvement compared to a baseline. The MSIE effectiveness was tested under the Aurora 2 database audio files. When trained in clean speech, the experimental results obtained by the MSIE concatenated to the MFCC outperform the results obtained with the MFCC baseline system for selected types of noises at different SNRs. For this selected group of noises the overall improvement performance in the range 0 dB to 20 dB for the Aurora 2 database is of 15.06%.
Keywords :
entropy; speech recognition; SNR; automatic speech recognition; baseline system; database audio files; dynamic coefficient; entropy coefficient; speech magnitude spectrum information entropy; Entropy; Signal to noise ratio; Speech; Speech recognition; Testing; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
ISSN :
1051-4651
Print_ISBN :
978-1-4244-7542-1
Type :
conf
DOI :
10.1109/ICPR.2010.1061
Filename :
5597871
Link To Document :
بازگشت