مرکز منطقه ای اطلاع رساني علوم و فناوري - Single-Sensor Audio Source Separation Using Classification and Estimation Approach and GARCH Modeling

DocumentCode :

3558770

Title :

Single-Sensor Audio Source Separation Using Classification and Estimation Approach and GARCH Modeling

Author :

Abramson, Ari ; Cohen, Israel

Author_Institution :

Dept. of Electr. Eng., Technion - Israel Inst. of Technol., Haifa

Volume :

Issue :

fYear :

2008

Firstpage :

1528

Lastpage :

1540

Abstract :

In this paper, we propose a new algorithm for single-sensor audio source separation of speech and music signals, which is based on generalized autoregressive conditional heteroscedasticity (GARCH) modeling of the speech signals and Gaussian mixture modeling (GMM) of the music signals. The separation of the speech from the music signal is obtained by a simultaneous classification and estimation approach, which enables one to control the tradeoff between residual interference and signal distortion. Experimental results on mixtures of speech and piano music signals have yielded an improved source separation performance compared to using Gaussian mixture models for both signals. The tradeoff between signal distortion and residual interference is controlled by adjusting some cost parameters, which are shown to determine the missed and false detection rates in the proposed classification and estimation approach.

Keywords :

Gaussian processes; audio signal processing; autoregressive processes; blind source separation; signal classification; GARCH modeling; Gaussian mixture modeling; classification approach; estimation approach; generalized autoregressive conditional heteroscedasticity; piano music signals; residual interference; signal distortion; single-sensor audio source separation; speech signals; Background noise; Costs; Distortion; Hidden Markov models; Instruments; Interference; Microphones; Multiple signal classification; Source separation; Speech enhancement; Detection and estimation; Source separation; generalized autoregressive conditional heteroscedasticity (GARCH);

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2008.2005351

Filename :

4648925

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3558770