• DocumentCode
    2270122
  • Title

    Robust feature selection for scaling ambiguity reduction in audio-visual convolutive BSS

  • Author

    Qingju Liu ; Naqvi, Syed Mohsen ; Wenwu Wang ; Jackson, Philip ; Chambers, Jonathon

  • Author_Institution
    Dept. of Electron. Eng., Univ. of Surrey, Guildford, UK
  • fYear
    2011
  • fDate
    Aug. 29 2011-Sept. 2 2011
  • Firstpage
    1060
  • Lastpage
    1064
  • Abstract
    Information from video has been used recently to address the issue of scaling ambiguity in convolutive blind source separation (BSS) in the frequency domain, based on statistical modeling of the audio-visual coherence with Gaussian mixture models (GMMs) in the feature space. However, outliers in the feature space may greatly degrade the system performance in both training and separation stages. In this paper, a new feature selection scheme is proposed to discard non-stationary features, which improves the robustness of the coherence model and reduces its computational complexity. The scaling parameters obtained by coherence maximization and non-linear interpolation from the selected features are applied to the separated frequency components to mitigate the scaling ambiguity. A multimodal database composed of different combinations of vowels and consonants was used to test our algorithm. Experimental results show the performance improvement with our proposed algorithm.
  • Keywords
    Gaussian processes; audio-visual systems; blind source separation; feature selection; frequency-domain analysis; interference suppression; mixture models; GMM; Gaussian mixture model; audiovisual coherence model; audiovisual convolutive BSS; blind source separation; coherence maximization; feature selection scheme; feature space; frequency components; frequency domain analysis; multimodal database; nonlinear interpolation; scaling ambiguity reduction; scaling parameters; statistical modeling; Coherence; Feature extraction; Frequency-domain analysis; Source separation; Speech; Training; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2011 19th European
  • Conference_Location
    Barcelona
  • ISSN
    2076-1465
  • Type

    conf

  • Filename
    7074127