• DocumentCode
    49106
  • Title

    Noise Model Transfer: Novel Approach to Robustness Against Nonstationary Noise

  • Author

    Yoshioka, Takashi ; Nakatani, Takeshi

  • Author_Institution
    NTT Commun. Sci. Labs., Nippon Telegraph & Telephone Corp., Kyoto, Japan
  • Volume
    21
  • Issue
    10
  • fYear
    2013
  • fDate
    Oct. 2013
  • Firstpage
    2182
  • Lastpage
    2192
  • Abstract
    This paper proposes an approach, called noise model transfer (NMT), for estimating the rapidly changing parameter values of a feature-domain noise model, which can be used to enhance feature vectors corrupted by highly nonstationary noise. Unlike conventional methods, the proposed approach can exploit both observed feature vectors, representing spectral envelopes, and other signal properties that are usually discarded during feature extraction but that are useful for separating nonstationary noise from speech. Specifically, we assume the availability of a noise power spectrum estimator that can capture rapid changes in noise characteristics by leveraging such signal properties. NMT determines the optimal transformation from the estimated noise power spectra into the feature-domain noise model parameter values in the sense of maximum likelihood. NMT is successfully applied to meeting speech recognition, where the main noise sources are competing talkers; and reverberant speech recognition, where the late reverberation is regarded as highly nonstationary additive noise.
  • Keywords
    feature extraction; maximum likelihood estimation; speech recognition; NMT; enhance feature vectors; feature extraction; feature-domain noise model; feature-domain noise model parameter values; maximum likelihood; noise model transfer; noise power spectrum estimator; noise sources; nonstationary additive noise; nonstationary noise; optimal transformation; parameter values; reverberant speech recognition; speech recognition; Additive noise; Feature extraction; Microphones; Speech; Speech recognition; Vectors; Meeting speech recognition; nonstationary noise; reverberation; robust speech recognition;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2013.2272513
  • Filename
    6563155