DocumentCode :
667537
Title :
Speech enhancement by sparse, low-rank, and dictionary spectrogram decomposition
Author :
Zhuo Chen ; Ellis, Daniel P. W.
fYear :
2013
fDate :
20-23 Oct. 2013
Firstpage :
1
Lastpage :
4
Abstract :
Speech enhancement requires some principle by which to distinguish speech and noise, and the most successful separation requires strong models for both speech and noise. If, however, the noise encountered differs significantly from the system´s assumptions, performance will suffer. In this work, we propose a novel speech enhancement system based on decomposing the spectrogram into sparse activation of a dictionary of target speech templates, and a low-rank background model, which makes few assumptions about the noise other than its limited spectral variation. A variation of this model specifically designed to handle transient noise intrusions is also proposed. Evaluation via BSS EVAL and PESQ show that the new approaches improve signal-to-distortion ratio in most cases and PESQ in high-noise conditions when compared to several traditional speech enhancement algorithms including log-MMSE.
Keywords :
least mean squares methods; speech enhancement; BSS EVAL; PESQ; dictionary sparse activation; dictionary spectrogram decomposition; log-MMSE; speech enhancement; Dictionaries; Noise; Noise measurement; Sparse matrices; Speech; Speech enhancement; Transient analysis; low-rank; robust PCA; sparse; spectrogram decomposition; speech enhancement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics (WASPAA), 2013 IEEE Workshop on
Conference_Location :
New Paltz, NY
ISSN :
1931-1168
Type :
conf
DOI :
10.1109/WASPAA.2013.6701883
Filename :
6701883
Link To Document :
بازگشت