• DocumentCode
    3333742
  • Title

    Active speech source localization by a dual coarse-to-fine search

  • Author

    Duraiswami, Rainani ; Zotkin, Dmitry ; Davis, Larry S.

  • Author_Institution
    Inst. for Adv. Comput. Studies, Maryland Univ., College Park, MD, USA
  • Volume
    5
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    3309
  • Abstract
    Accurate and fast localization of multiple speech sound sources is a significant problem in videoconferencing systems. Based on the observation that the wavelengths of the sound from a speech source are comparable to the dimensions of the space being searched, and that the source is broadband, we develop an efficient search strategy that finds the source(s) in a given space. The search is made efficient by using coarse-to-fine strategies in both space and frequency. The algorithm is shown to be robust compared to typical delay-based estimators and fast enough for real-time implementation. Its performance can be further improved by using constraints from computer vision
  • Keywords
    array signal processing; speech processing; teleconferencing; active speech source localization; array signal processing; delay-based estimators; dual coarse-to-fine search; frequency; multiple speech sound sources; real-time implementation; space; teleconferencing; videoconferencing systems; Array signal processing; Computer interfaces; Delay effects; Delay estimation; Inverse problems; Laboratories; Position measurement; Sensor arrays; Signal processing algorithms; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
  • Conference_Location
    Salt Lake City, UT
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7041-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2001.940366
  • Filename
    940366