مرکز منطقه ای اطلاع رساني علوم و فناوري - Online REPET-SIM for real-time speech enhancement

DocumentCode :

1654163

Title :

Online REPET-SIM for real-time speech enhancement

Author :

Rafii, Zafar ; Pardo, Bryan

Author_Institution :

EECS Dept., Northwestern Univ., Evanston, IL, USA

fYear :

2013

Firstpage :

848

Lastpage :

852

Abstract :

REPET-SIM is a generalization of the REpeating Pattern Extraction Technique (REPET) that uses a similarity matrix to separate the repeating background from the non-repeating foreground in a mixture. The method assumes that the background (typically the music accompaniment) is dense and low-ranked, while the foreground (typically the singing voice) is sparse and varied. While this assumption is often true for background music and foreground voice in musical mixtures, it also often holds for background noise and foreground speech in noisy mixtures. We therefore propose here to extend REPET-SIM for noise/speech segregation. In particular, given the low computational complexity of the algorithm, we show that the method can be easily implemented online for real-time processing. Evaluation on a data set of 10 stereo two-channel mixtures of speech and real-world background noise showed that this online REPET-SIM can be successfully applied for real-time speech enhancement, performing as well as different competitive methods.

Keywords :

computational complexity; pattern recognition; real-time systems; speech enhancement; background music; background noise; computational complexity; foreground speech; foreground voice; music accompaniment; noise-speech segregation; nonrepeating foreground; online REPET-SIM; real-time processing; real-time speech enhancement; repeating background; repeating pattern extraction technique; similarity matrix; singing voice; Estimation; Noise; Noise measurement; Real-time systems; Speech; Speech enhancement; Time-frequency analysis; Blind source separation; real-time; repeating patterns; similarity matrix; speech enhancement;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on

Conference_Location :

Vancouver, BC

ISSN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2013.6637768

Filename :

6637768

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1654163