DocumentCode :
1118143
Title :
Single-Mixture Audio Source Separation by Subspace Decomposition of Hilbert Spectrum
Author :
Molla, Md Khademul Islam ; Hirose, Keikichi
Author_Institution :
Graduate Sch. of Inf. Sci. & Technol., Univ. of Tokyo
Volume :
15
Issue :
3
fYear :
2007
fDate :
3/1/2007 12:00:00 AM
Firstpage :
893
Lastpage :
900
Abstract :
A novel technique is developed to separate the audio sources from a single mixture. The method is based on decomposing the Hilbert spectrum (HS) of the mixed signal into independent source subspaces. Hilbert transform combined with empirical mode decomposition (EMD) constitutes HS, which is a fine-resolution time-frequency representation of a nonstationary signal. The EMD represents any time-domain signal as the sum of a finite set of oscillatory components called intrinsic mode functions (IMFs). After computing the spectral projections between the mixed signal and the individual IMF components, the projection vectors are used to derive a set of spectral independent bases by applying principal component analysis (PCA) and independent component analysis (ICA). A k-means clustering algorithm based on Kulback-Leibler divergence (KLd) is introduced to group the independent basis vectors into the number of component sources inside the mixture. The HS of the mixed signal is projected onto the space spanned by each group of basis vectors yielding the independent source subspaces. The time-domain source signals are reconstructed by applying the inverse transformation. Experimental results show that the proposed algorithm performs separation of speech and interfering sound from a single mixture
Keywords :
Hilbert transforms; audio signal processing; independent component analysis; principal component analysis; signal reconstruction; signal representation; source separation; time-frequency analysis; Hilbert spectrum; Hilbert transform; Kulback-Leibler divergence; empirical mode decomposition; fine-resolution time-frequency representation; independent component analysis; intrinsic mode functions; k-means clustering algorithm; nonstationary signal; principal component analysis; single-mixture audio source separation; spectral projections; subspace decomposition; time-domain signal; Clustering algorithms; Independent component analysis; Information science; Instruction sets; Layout; Principal component analysis; Source separation; Speech; Time domain analysis; Time frequency analysis; Empirical mode decomposition (EMD); Hilbert spectrum; Kullback- Leibler divergence; independent component analysis (ICA);
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2006.885254
Filename :
4100684
Link To Document :
بازگشت