مرکز منطقه ای اطلاع رساني علوم و فناوري - Feature enhancement using sparse reference and estimated soft-mask exemplar-pairs for noisy speech recognition

DocumentCode :

178056

Title :

Feature enhancement using sparse reference and estimated soft-mask exemplar-pairs for noisy speech recognition

Author :

Lee Ngee Tan ; Alwan, Abeer

Author_Institution :

Dept. of Electr. Eng., Univ. of California, Los Angeles, Los Angeles, CA, USA

fYear :

2014

fDate :

4-9 May 2014

Firstpage :

1710

Lastpage :

1714

Abstract :

A feature enhancement technique for noise-robust speech recognition is proposed. Existing sparse exemplar-based feature enhancement methods use clean speech and pure noise Mel-spectral exemplars, or clean and noisy speech log-Mel-spectral exemplar-pairs, in their dictionaries. In contrast, the proposed technique constructs its dictionaries using reference soft-mask (SM_ref) and estimated soft-mask (SM_est) exemplar-pairs derived from the training data. The sparse linear combination of SM_est dictionary exemplars that best represents the test utterance´s SM_est is obtained by solving an L1-minimization problem. This sparse linear combination is applied to the SM_ref exemplar dictionary to generate an enhanced soft-mask for denoising the utterance´s Mel-spectra before MFCC extraction. On the Aurora-2 noisy speech recognition task, the proposed algorithm outperforms other sparse Mel-spectral exemplar-based feature enhancement schemes when mismatch exists between the dictionary exemplars and the test set. A preliminary experiment on Aurora-4 shows similar trends.

Keywords :

minimisation; signal denoising; speech enhancement; speech recognition; Aurora-2 noisy speech recognition; L1-minimization problem; MFCC extraction; SMest dictionary exemplars; feature enhancement; noise robust speech recognition; noisy speech log-Mel-spectral exemplar-pairs; signal denoising; soft-mask exemplar-pairs; sparse linear combination; Dictionaries; Feature extraction; Noise; Noise measurement; Speech; Speech recognition; Training; Feature enhancement; joint dictionary; noisy speech recognition; soft mask estimation; sparse exemplar;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on

Conference_Location :

Florence

Type :

conf

DOI :

10.1109/ICASSP.2014.6853890

Filename :

6853890

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=178056