• DocumentCode
    178682
  • Title

    Underdetermined blind separation and tracking of moving sources based ONDOA-HMM

  • Author

    Higuchi, Tatsuro ; Takamune, Norihiro ; Nakamura, T. ; Kameoka, Hirokazu

  • Author_Institution
    Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Tokyo, Japan
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    3191
  • Lastpage
    3195
  • Abstract
    This paper deals with the problem of the underdetermined blind separation and tracking of moving sources. In practical situations, sound sources such as human speakers can move freely and so blind separation algorithms must be designed to track the temporal changes of the impulse responses. We propose solving this problem through the posterior inference of the parameters in a generative model of an observed multichannel signal, formulated under the assumption of the sparsity of time-frequency components of speech and the continuity of speakers´ movements. Specifically, we describe a generative model of mixture signals by incorporating a generative model of a time-varying frequency array response for each source, described using a path-restricted hidden Markov model (HMM). Each hidden state of the present HMM represents the direction of arrival (DOA) of each source, and so we call it a “DOA-HMM.” Through the posterior inference of the overall generative model, we can simultaneously track the DOAs of sources, separate source signals and perform permutation alignment. The experiment showed that the proposed algorithm provided a 6.20 dB improvement compared with the conventional method in terms of the signal-to-interference ratio.
  • Keywords
    Bayes methods; audio signal processing; blind source separation; direction-of-arrival estimation; frequency response; hidden Markov models; tracking; transient response; DOA-HMM; approximate posterior inference; direction-of-arrival hidden Markov model; impulse response; moving sources; multichannel signal; permutation alignment; signal-to-interference ratio; time-varying frequency array response; underdetermined blind separation; variational inference; Arrays; Direction-of-arrival estimation; Hidden Markov models; Microphones; Source separation; Speech; Time-frequency analysis; Underdetermined blind separation; direction of arrival; hidden Markov model; moving sources; variational inference;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6854189
  • Filename
    6854189