مرکز منطقه ای اطلاع رساني علوم و فناوري - Speech enhancement by sparse, low-rank, and dictionary spectrogram decomposition

DocumentCode :

667537

Title :

Speech enhancement by sparse, low-rank, and dictionary spectrogram decomposition

Author :

Zhuo Chen ; Ellis, Daniel P. W.

fYear :

2013

fDate :

20-23 Oct. 2013

Firstpage :

Lastpage :

Abstract :

Speech enhancement requires some principle by which to distinguish speech and noise, and the most successful separation requires strong models for both speech and noise. If, however, the noise encountered differs significantly from the system´s assumptions, performance will suffer. In this work, we propose a novel speech enhancement system based on decomposing the spectrogram into sparse activation of a dictionary of target speech templates, and a low-rank background model, which makes few assumptions about the noise other than its limited spectral variation. A variation of this model specifically designed to handle transient noise intrusions is also proposed. Evaluation via BSS EVAL and PESQ show that the new approaches improve signal-to-distortion ratio in most cases and PESQ in high-noise conditions when compared to several traditional speech enhancement algorithms including log-MMSE.

Keywords :

least mean squares methods; speech enhancement; BSS EVAL; PESQ; dictionary sparse activation; dictionary spectrogram decomposition; log-MMSE; speech enhancement; Dictionaries; Noise; Noise measurement; Sparse matrices; Speech; Speech enhancement; Transient analysis; low-rank; robust PCA; sparse; spectrogram decomposition; speech enhancement;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Applications of Signal Processing to Audio and Acoustics (WASPAA), 2013 IEEE Workshop on

Conference_Location :

New Paltz, NY

ISSN :

1931-1168

Type :

conf

DOI :

10.1109/WASPAA.2013.6701883

Filename :

6701883

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=667537