DocumentCode :
2506432
Title :
Semi-blind Speech-Music Separation Using Sparsity and Continuity Priors
Author :
Erdogan, Hakan ; Grais, Emad M.
Author_Institution :
Fac. of Eng. & Natural Sci., Sabanci Univ., Istanbul, Turkey
fYear :
2010
fDate :
23-26 Aug. 2010
Firstpage :
4573
Lastpage :
4576
Abstract :
In this paper we propose an approach for the problem of single channel source separation of speech and music signals. Our approach is based on representing each source´s power spectral density using dictionaries and nonlinearly projecting the mixture signal spectrum onto the combined span of the dictionary entries. We encourage sparsity and continuity of the dictionary coefficients using penalty terms (or log-priors) in an optimization framework. We propose to use a novel coordinate descent technique for optimization, which nicely handles nonnegativity constraints and nonquadratic penalty terms. We use an adaptive Wiener filter, and spectral subtraction to reconstruct both of the sources from the mixture data after corresponding power spectral densities (PSDs) are estimated for each source. Using conventional metrics, we measure the performance of the system on simulated mixtures of single person speech and piano music sources. The results indicate that the proposed method is a promising technique for low speech-to-music ratio conditions and that sparsity and continuity priors help improve the performance of the proposed system.
Keywords :
Wiener filters; adaptive filters; blind source separation; music; speech processing; adaptive Wiener filter; continuity prior; coordinate descent technique; mixture signal spectrum; nonnegativity constraints; nonquadratic penalty terms; power spectral density; semi-blind speech-music separation; single channel source separation; sparsity prior; spectral subtraction; speech-to-music ratio conditions; Dictionaries; Multiple signal classification; Optimization; Source separation; Speech; Speech recognition; Training; semi-blind signal separation; single channel speech-music separation; sparsity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
ISSN :
1051-4651
Print_ISBN :
978-1-4244-7542-1
Type :
conf
DOI :
10.1109/ICPR.2010.1129
Filename :
5597375
Link To Document :
بازگشت