• DocumentCode
    3547790
  • Title

    Blind extraction of a dominant source from mixtures of many sources using ICA and time-frequency masking

  • Author

    Sawada, Hiroshi ; Araki, Shoko ; Mukai, Ryo ; Makino, Shoji

  • Author_Institution
    NTr Commun. Sci. Labs., NTT Corp., Kyoto, Japan
  • fYear
    2005
  • fDate
    23-26 May 2005
  • Firstpage
    5882
  • Abstract
    The paper presents a method for enhancing a target source of interest and suppressing other interference sources. The target source is assumed to be close to sensors, to have dominant power at these sensors, and to have non-Gaussianity. The enhancement is performed blindly, i.e., without knowing the total number of sources or information about each source, such as position and active time. We consider a general case where the number of sources is larger than the number of sensors. We employ a two-stage process where independent component analysis (ICA) is first employed in each frequency bin and time-frequency masking is then used to improve the performance further. We propose a new sophisticated method for selecting the target source frequency components, and also a new criterion for specifying time-frequency masks. Experimental results for simulated cocktail party situations in a room (reverberation time was 130 ms) are presented to show the effectiveness and characteristics of the proposed method.
  • Keywords
    audio signal processing; blind source separation; independent component analysis; interference suppression; time-frequency analysis; 130 ms; ICA; audio applications; blind dominant source extraction; blind source extraction; blind source separation; cocktail party situations; frequency bin; independent component analysis; interference source suppression; interference suppression; speech enhancement; target source enhancement; target source frequency components; time-frequency masking; Blind source separation; Brain; Independent component analysis; Interference suppression; Laboratories; Reverberation; Sensor phenomena and characterization; Source separation; Speech; Time frequency analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Circuits and Systems, 2005. ISCAS 2005. IEEE International Symposium on
  • Print_ISBN
    0-7803-8834-8
  • Type

    conf

  • DOI
    10.1109/ISCAS.2005.1465977
  • Filename
    1465977