Title :
Content-adaptive speech enhancement by a sparsely-activated dictionary plus low rank decomposition
Author :
Zhuo Chen ; Papadopoulos, Helene ; Ellis, Daniel P. W.
Abstract :
One powerful approach to speech enhancement employs strong models for both speech and noise, decomposing a mixture into the most likely combination. But if the noise encountered differs significantly from the system´s assumptions, performance will suffer. In previous work, we proposed a speech enhancement model that decomposes the spectrogram into sparse activation of a dictionary of target speech templates, and a low-rank background model. This makes few assumptions about the noise, and gave appealing results on small excerpts of noisy speech. However, when processing whole conversations, the foreground speech may vary in its complexity and may be unevenly distributed throughout the recording, resulting in inaccurate decompositions for some segments. In this paper, we explore an adaptive formulation of our previous model that incorporates separate side information to guide the decomposition, making it able to better process entire conversations that may exhibit large variations in the speech content.
Keywords :
learning (artificial intelligence); sparse matrices; speech enhancement; content-adaptive speech enhancement; conversation processing; foreground speech; low-rank decomposition; noisy speech; sparsely-activated dictionary; speech content; Adaptation models; Dictionaries; Noise; Noise measurement; Spectrogram; Speech; Speech enhancement; low-rank; robust PCA; sparse; spectrogram decomposition; speech enhancement; voice activity detection;
Conference_Titel :
Hands-free Speech Communication and Microphone Arrays (HSCMA), 2014 4th Joint Workshop on
Conference_Location :
Villers-les-Nancy
DOI :
10.1109/HSCMA.2014.6843242