Title :
Speech enhancement via low-rank matrix decomposition and image based masking
Author :
LiYang Liu ; Zhaogui Ding ; Weifeng Li ; Longbiao Wang ; Qingmin Liao
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Shenzhen, China
Abstract :
Speech enhancement is an important task in many applications such as speech recognition. Conventional methods always require some principles by which to distinguish speech and noise and the most successful enhancement requires strong models for both speech and noise. However, if the noise actually encountered differs significantly from the system´s assumptions, performance will rapidly declines. In this work, we propose an unsupervised speech enhancement system based on decomposing the frequency-time spectrogram into a sparse foreground speech and a low-rank background noise, which makes few assumptions about the noise other than its limited spectral variation. An image based masking is also designed to handle the poor performance of noise removing when using spectrogram decomposition only. Evaluations via PESQ and SegSNR show that the new approach improves signal-to-distortion ratio and PESQ in most cases when compared to several traditional speech enhancement algorithms.
Keywords :
matrix decomposition; signal denoising; speech enhancement; speech recognition; PESQ; SegSNR; frequency-time spectrogram; image based masking; low-rank background noise; low-rank matrix decomposition; noise removal; signal-to-distortion ratio; sparse foreground speech; spectrogram decomposition; speech recognition; unsupervised speech enhancement system; Matrix decomposition; Noise; Noise measurement; Sparse matrices; Spectrogram; Speech; Speech enhancement; RPCA; low-rank; sparse; spectrogram decomposition; speech enhancement;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
Conference_Location :
Singapore
DOI :
10.1109/ISCSLP.2014.6936687