• DocumentCode
    178056
  • Title

    Feature enhancement using sparse reference and estimated soft-mask exemplar-pairs for noisy speech recognition

  • Author

    Lee Ngee Tan ; Alwan, Abeer

  • Author_Institution
    Dept. of Electr. Eng., Univ. of California, Los Angeles, Los Angeles, CA, USA
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    1710
  • Lastpage
    1714
  • Abstract
    A feature enhancement technique for noise-robust speech recognition is proposed. Existing sparse exemplar-based feature enhancement methods use clean speech and pure noise Mel-spectral exemplars, or clean and noisy speech log-Mel-spectral exemplar-pairs, in their dictionaries. In contrast, the proposed technique constructs its dictionaries using reference soft-mask (SMref) and estimated soft-mask (SMest) exemplar-pairs derived from the training data. The sparse linear combination of SMest dictionary exemplars that best represents the test utterance´s SMest is obtained by solving an L1-minimization problem. This sparse linear combination is applied to the SMref exemplar dictionary to generate an enhanced soft-mask for denoising the utterance´s Mel-spectra before MFCC extraction. On the Aurora-2 noisy speech recognition task, the proposed algorithm outperforms other sparse Mel-spectral exemplar-based feature enhancement schemes when mismatch exists between the dictionary exemplars and the test set. A preliminary experiment on Aurora-4 shows similar trends.
  • Keywords
    minimisation; signal denoising; speech enhancement; speech recognition; Aurora-2 noisy speech recognition; L1-minimization problem; MFCC extraction; SMest dictionary exemplars; feature enhancement; noise robust speech recognition; noisy speech log-Mel-spectral exemplar-pairs; signal denoising; soft-mask exemplar-pairs; sparse linear combination; Dictionaries; Feature extraction; Noise; Noise measurement; Speech; Speech recognition; Training; Feature enhancement; joint dictionary; noisy speech recognition; soft mask estimation; sparse exemplar;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6853890
  • Filename
    6853890