• DocumentCode
    667537
  • Title

    Speech enhancement by sparse, low-rank, and dictionary spectrogram decomposition

  • Author

    Zhuo Chen ; Ellis, Daniel P. W.

  • fYear
    2013
  • fDate
    20-23 Oct. 2013
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Speech enhancement requires some principle by which to distinguish speech and noise, and the most successful separation requires strong models for both speech and noise. If, however, the noise encountered differs significantly from the system´s assumptions, performance will suffer. In this work, we propose a novel speech enhancement system based on decomposing the spectrogram into sparse activation of a dictionary of target speech templates, and a low-rank background model, which makes few assumptions about the noise other than its limited spectral variation. A variation of this model specifically designed to handle transient noise intrusions is also proposed. Evaluation via BSS EVAL and PESQ show that the new approaches improve signal-to-distortion ratio in most cases and PESQ in high-noise conditions when compared to several traditional speech enhancement algorithms including log-MMSE.
  • Keywords
    least mean squares methods; speech enhancement; BSS EVAL; PESQ; dictionary sparse activation; dictionary spectrogram decomposition; log-MMSE; speech enhancement; Dictionaries; Noise; Noise measurement; Sparse matrices; Speech; Speech enhancement; Transient analysis; low-rank; robust PCA; sparse; spectrogram decomposition; speech enhancement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Applications of Signal Processing to Audio and Acoustics (WASPAA), 2013 IEEE Workshop on
  • Conference_Location
    New Paltz, NY
  • ISSN
    1931-1168
  • Type

    conf

  • DOI
    10.1109/WASPAA.2013.6701883
  • Filename
    6701883