• DocumentCode
    2177499
  • Title

    Speech enhancement based on log spectral envelope model and harmonicity-derived spectral mask, and its coupling with feature compensation

  • Author

    Yoshioka, Takuya ; Nakatani, Tomohiro

  • Author_Institution
    Commun. Sci. Labs., NTT Corp., Seika, Japan
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    5064
  • Lastpage
    5067
  • Abstract
    The use of a speech spectral envelope model defined in the log spectrum-type domain is a common approach to feature enhancement for noise robust speech recognition. However, from the noise reduction viewpoint, this approach ignores non-peak components of a spectrum and thus suffers from the poor SNR improvement during voiced periods. This paper proposes a speech enhancement method that exploits a log spectral envelope model and a harmonic structure. The key to the method is its use of a harmonic structure to define the prior distribution of a spectral mask, which is used for both accurate noise estimation and attenuation. In addition, we combine log mel-frequency feature enhancement with the above method to take advantage of low dimensionality. The whole proposed method outperforms a state-of-the-art speech enhancement method in four different noise environments.
  • Keywords
    speech enhancement; feature compensation; harmonicity-derived spectral mask; log mel-frequency feature enhancement; log spectral envelope model; log spectrum-type domain; noise attenuation; noise estimation; speech enhancement; speech spectral envelope model; Indexes; Nickel; Noise measurement; Speech enhancement; harmonic structure; log mel-frequency spectrum; log spectrum; spectral mask;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947495
  • Filename
    5947495