• DocumentCode
    867368
  • Title

    Speech Separation of Multiple Moving Speakers Using Multisensor Multitarget Techniques

  • Author

    Potamitis, Ilyas ; Kokkinakis, George

  • Author_Institution
    Dept. of Music Technol. & Acoust., Technol. Educ. Inst. of Crete, Rethymno
  • Volume
    37
  • Issue
    1
  • fYear
    2007
  • Firstpage
    72
  • Lastpage
    81
  • Abstract
    The general problem addressed in this paper is that of separating the voices of active moving speakers in the presence of background noise and moderate reverberation level in the acoustic field using a single microphone array. We adapt the multisensor multitarget tracking theory to the context of microphone arrays in order to form receptive beams that lock on each moving speaker on an extended time basis and therefore, achieve voice separation. Our approach: 1) incorporates kinematical information of speakers´ movement by using an interacting multiple model (IMM) estimator per speaker in order to constrain the evolution of direction of arrival (DOA) measurements, which characterize various motions of the speakers, and 2) can directly account for measurement origin uncertainty, i.e., which measurement comes from which speaker, by using the probabilistic-data-association technique in conjunction with the IMM estimator. The effectiveness of the approach is illustrated by an extensive simulation study on tracking the DOAs of two speakers with crossing trajectories and three static speakers having a conversation with partially overlapping speech and long pauses
  • Keywords
    direction-of-arrival estimation; microphone arrays; sensor fusion; speech processing; target tracking; direction of arrival measurements; multiple moving speakers; multisensor multitarget techniques; probabilistic-data-association technique; single microphone array; speech separation; speech signal processing; Acoustic arrays; Adaptive arrays; Background noise; Direction of arrival estimation; Loudspeakers; Measurement uncertainty; Microphone arrays; Motion estimation; Reverberation; Speech; Microphone array applications; speaker direction of arrival (DOA) tracking; speech signal processing; voice separation;
  • fLanguage
    English
  • Journal_Title
    Systems, Man and Cybernetics, Part A: Systems and Humans, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1083-4427
  • Type

    jour

  • DOI
    10.1109/TSMCA.2006.886338
  • Filename
    4032927