DocumentCode
788453
Title
Blind Extraction of Dominant Target Sources Using ICA and Time-Frequency Masking
Author
Sawada, Hiroshi ; Araki, Shoko ; Mukai, Ryo ; Makino, Shoji
Author_Institution
NTT Commun. Sci. Lab., NTT Corp., Kyoto
Volume
14
Issue
6
fYear
2006
Firstpage
2165
Lastpage
2173
Abstract
This paper presents a method for enhancing target sources of interest and suppressing other interference sources. The target sources are assumed to be close to sensors, to have dominant powers at these sensors, and to have non-Gaussianity. The enhancement is performed blindly, i.e., without knowing the position and active time of each source. We consider a general case where the total number of sources is larger than the number of sensors, and neither the number of target sources nor the total number of sources is known. The method is based on a two-stage process where independent component analysis (ICA) is first employed in each frequency bin and then time-frequency masking is used to improve the performance further. We propose a new sophisticated method for deciding the number of target sources and then selecting their frequency components. We also propose a new criterion for specifying time-frequency masks. Experimental results for simulated cocktail party situations in a room, whose reverberation time was 130 ms, are presented to show the effectiveness and characteristics of the proposed method
Keywords
blind source separation; feature extraction; independent component analysis; interference suppression; time-frequency analysis; ICA; blind extraction; dominant target sources; independent component analysis; interference sources suppression; time-frequency masking; Blind source separation; Frequency domain analysis; Helium; Independent component analysis; Interference suppression; Reverberation; Sensor phenomena and characterization; Source separation; Speech processing; Time frequency analysis; Blind source extraction; blind source separation (BSS); convolutive mixture; frequency domain; independent component analysis; permutation problem; time-frequency masking;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2006.872599
Filename
1709904
Link To Document