• DocumentCode
    3510777
  • Title

    Multimodal blind source separation for moving sources

  • Author

    Naqvi, S.M. ; Zhang, Y. ; Chambers, J.A.

  • Author_Institution
    Dept. of Electron. & Electr. Eng., Loughborough Univ., Loughborough
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    125
  • Lastpage
    128
  • Abstract
    A novel multimodal approach is proposed to solve the problem of blind source separation (BSS) of moving sources. The challenge of BSS for moving sources is that the mixing filters are time varying, thus the unmixing filters should also be time varying, which are difficult to track in real time. In the proposed approach, the visual modality is utilized to facilitate the separation for both stationary and moving sources. The movement of the sources is detected by a 3-D tracker based on particle filtering. The full BSS solution is formed by integrating a frequency domain blind source separation algorithm and beamforming: if the sources are identified as stationary, a frequency domain BSS algorithm is implemented with an initialization derived from the visual information. Once the sources are moving, a beamforming algorithm is used to perform real time speech enhancement and provide separation of the sources. Experimental results show that by utilizing the visual modality, the proposed algorithm can not only improve the performance of the BSS algorithm and mitigate the permutation problem for stationary sources, but also provide a good BSS performance for moving sources in a low reverberant environment.
  • Keywords
    blind source separation; particle filtering (numerical methods); speech enhancement; 3D tracker; beamforming algorithm; frequency domain blind source separation algorithm; mixing filter; moving sources; multimodal blind source separation; particle filtering; permutation problem; real time speech enhancement; visual modality; Array signal processing; Blind source separation; Cameras; Data mining; Filtering; Filters; Frequency domain analysis; Particle tracking; Signal processing algorithms; Source separation; 3-D tracking; BSS; FastICA; beamforming; multimodal signal processing; particle filtering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4959536
  • Filename
    4959536