Title :
Online REPET-SIM for real-time speech enhancement
Author :
Rafii, Zafar ; Pardo, Bryan
Author_Institution :
EECS Dept., Northwestern Univ., Evanston, IL, USA
Abstract :
REPET-SIM is a generalization of the REpeating Pattern Extraction Technique (REPET) that uses a similarity matrix to separate the repeating background from the non-repeating foreground in a mixture. The method assumes that the background (typically the music accompaniment) is dense and low-ranked, while the foreground (typically the singing voice) is sparse and varied. While this assumption is often true for background music and foreground voice in musical mixtures, it also often holds for background noise and foreground speech in noisy mixtures. We therefore propose here to extend REPET-SIM for noise/speech segregation. In particular, given the low computational complexity of the algorithm, we show that the method can be easily implemented online for real-time processing. Evaluation on a data set of 10 stereo two-channel mixtures of speech and real-world background noise showed that this online REPET-SIM can be successfully applied for real-time speech enhancement, performing as well as different competitive methods.
Keywords :
computational complexity; pattern recognition; real-time systems; speech enhancement; background music; background noise; computational complexity; foreground speech; foreground voice; music accompaniment; noise-speech segregation; nonrepeating foreground; online REPET-SIM; real-time processing; real-time speech enhancement; repeating background; repeating pattern extraction technique; similarity matrix; singing voice; Estimation; Noise; Noise measurement; Real-time systems; Speech; Speech enhancement; Time-frequency analysis; Blind source separation; real-time; repeating patterns; similarity matrix; speech enhancement;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6637768