DocumentCode
667537
Title
Speech enhancement by sparse, low-rank, and dictionary spectrogram decomposition
Author
Zhuo Chen ; Ellis, Daniel P. W.
fYear
2013
fDate
20-23 Oct. 2013
Firstpage
1
Lastpage
4
Abstract
Speech enhancement requires some principle by which to distinguish speech and noise, and the most successful separation requires strong models for both speech and noise. If, however, the noise encountered differs significantly from the system´s assumptions, performance will suffer. In this work, we propose a novel speech enhancement system based on decomposing the spectrogram into sparse activation of a dictionary of target speech templates, and a low-rank background model, which makes few assumptions about the noise other than its limited spectral variation. A variation of this model specifically designed to handle transient noise intrusions is also proposed. Evaluation via BSS EVAL and PESQ show that the new approaches improve signal-to-distortion ratio in most cases and PESQ in high-noise conditions when compared to several traditional speech enhancement algorithms including log-MMSE.
Keywords
least mean squares methods; speech enhancement; BSS EVAL; PESQ; dictionary sparse activation; dictionary spectrogram decomposition; log-MMSE; speech enhancement; Dictionaries; Noise; Noise measurement; Sparse matrices; Speech; Speech enhancement; Transient analysis; low-rank; robust PCA; sparse; spectrogram decomposition; speech enhancement;
fLanguage
English
Publisher
ieee
Conference_Titel
Applications of Signal Processing to Audio and Acoustics (WASPAA), 2013 IEEE Workshop on
Conference_Location
New Paltz, NY
ISSN
1931-1168
Type
conf
DOI
10.1109/WASPAA.2013.6701883
Filename
6701883
Link To Document