• DocumentCode
    2559839
  • Title

    Particle Filter Algorithm for Single Speaker Tracking with Audio-Video Data Fusion

  • Author

    Lim, Yoon Seob ; Choi, Jong-Suk ; Kim, Munsang

  • Author_Institution
    Korea Inst. of Sci. & Technol., Seoul
  • fYear
    2007
  • fDate
    26-29 Aug. 2007
  • Firstpage
    363
  • Lastpage
    367
  • Abstract
    In this paper, a single speaker tracking mechanism with particle filter is presented for a reliable interaction with robot. To have a smooth tracking result, we use not only audio data but also video data. For sound observation, we use Time Delay Of Arrival (TDOA) between two microphones, and in order to expect high reliability in the likelihood of localization, we apply variable sensitivity values according to the angle of arrival for a triangle arrangement of microphones to calculate the likelihood for audio observation. For video observation, we use face detection routine of OpenCV which is widely available through the Internet. Together with audio and video data, we found out that more reliable source tracking is possible if we apply data fusion mechanism at every possible environment on a robot platform than single information.
  • Keywords
    audio signal processing; direction-of-arrival estimation; face recognition; man-machine systems; microphones; particle filtering (numerical methods); robots; sensor fusion; speaker recognition; video signal processing; Internet; audio-video data fusion; face detection; microphone; particle filter algorithm; reliable robot interaction; single speaker tracking; source tracking; time delay of arrival; variable sensitivity value; Acoustic noise; Cognitive robotics; Delay effects; Equations; Human robot interaction; Loudspeakers; Microphones; Particle filters; Particle tracking; Robot sensing systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Robot and Human interactive Communication, 2007. RO-MAN 2007. The 16th IEEE International Symposium on
  • Conference_Location
    Jeju
  • Print_ISBN
    978-1-4244-1634-9
  • Electronic_ISBN
    978-1-4244-1635-6
  • Type

    conf

  • DOI
    10.1109/ROMAN.2007.4415110
  • Filename
    4415110