DocumentCode :
178056
Title :
Feature enhancement using sparse reference and estimated soft-mask exemplar-pairs for noisy speech recognition
Author :
Lee Ngee Tan ; Alwan, Abeer
Author_Institution :
Dept. of Electr. Eng., Univ. of California, Los Angeles, Los Angeles, CA, USA
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
1710
Lastpage :
1714
Abstract :
A feature enhancement technique for noise-robust speech recognition is proposed. Existing sparse exemplar-based feature enhancement methods use clean speech and pure noise Mel-spectral exemplars, or clean and noisy speech log-Mel-spectral exemplar-pairs, in their dictionaries. In contrast, the proposed technique constructs its dictionaries using reference soft-mask (SMref) and estimated soft-mask (SMest) exemplar-pairs derived from the training data. The sparse linear combination of SMest dictionary exemplars that best represents the test utterance´s SMest is obtained by solving an L1-minimization problem. This sparse linear combination is applied to the SMref exemplar dictionary to generate an enhanced soft-mask for denoising the utterance´s Mel-spectra before MFCC extraction. On the Aurora-2 noisy speech recognition task, the proposed algorithm outperforms other sparse Mel-spectral exemplar-based feature enhancement schemes when mismatch exists between the dictionary exemplars and the test set. A preliminary experiment on Aurora-4 shows similar trends.
Keywords :
minimisation; signal denoising; speech enhancement; speech recognition; Aurora-2 noisy speech recognition; L1-minimization problem; MFCC extraction; SMest dictionary exemplars; feature enhancement; noise robust speech recognition; noisy speech log-Mel-spectral exemplar-pairs; signal denoising; soft-mask exemplar-pairs; sparse linear combination; Dictionaries; Feature extraction; Noise; Noise measurement; Speech; Speech recognition; Training; Feature enhancement; joint dictionary; noisy speech recognition; soft mask estimation; sparse exemplar;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6853890
Filename :
6853890
Link To Document :
بازگشت