• DocumentCode
    2923669
  • Title

    Voice source localization for automatic camera pointing system in videoconferencing

  • Author

    Wang, Hong ; Chu, Peter

  • Author_Institution
    PictureTel Corp. M/S, Andover, MA, USA
  • Volume
    1
  • fYear
    1997
  • fDate
    21-24 Apr 1997
  • Firstpage
    187
  • Abstract
    This paper describes the voice source localization algorithm used in the PictureTel automatic camera pointing system (LimeLightTM , dynamic speech locating technology). The system uses an array of 46 cm wide and 30 cm high, which contains 4 microphones, and is mounted on top of the monitor. The three dimensional position of a sound source is calculated from the time delays of 4 pairs of microphones. In time delay estimation, the averaging of signal onsets of each frequency band is combined with phase correlation to reduce the influence of noise and reverberation. With this approach, it is possible to provide reliable three dimensional voice source localization by a small microphone array. Post processing based on a priori knowledge is also introduced to eliminate the influences of reflections from furniture such as tables. Results of speech source localization under real conference room conditions are given. Some system related issues are also discussed
  • Keywords
    acoustic signal detection; delays; microphones; position control; teleconferencing; video cameras; LimeLight; PictureTel automatic camera pointing system; dynamic speech locating technology; noise; phase correlation; reverberation; small microphone array; sound source; time delay estimation; videoconferencing; voice source localization; Acoustic noise; Cameras; Delay effects; Delay estimation; Frequency estimation; Microphone arrays; Monitoring; Noise reduction; Phase estimation; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
  • Conference_Location
    Munich
  • ISSN
    1520-6149
  • Print_ISBN
    0-8186-7919-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1997.599595
  • Filename
    599595