DocumentCode
3333742
Title
Active speech source localization by a dual coarse-to-fine search
Author
Duraiswami, Rainani ; Zotkin, Dmitry ; Davis, Larry S.
Author_Institution
Inst. for Adv. Comput. Studies, Maryland Univ., College Park, MD, USA
Volume
5
fYear
2001
fDate
2001
Firstpage
3309
Abstract
Accurate and fast localization of multiple speech sound sources is a significant problem in videoconferencing systems. Based on the observation that the wavelengths of the sound from a speech source are comparable to the dimensions of the space being searched, and that the source is broadband, we develop an efficient search strategy that finds the source(s) in a given space. The search is made efficient by using coarse-to-fine strategies in both space and frequency. The algorithm is shown to be robust compared to typical delay-based estimators and fast enough for real-time implementation. Its performance can be further improved by using constraints from computer vision
Keywords
array signal processing; speech processing; teleconferencing; active speech source localization; array signal processing; delay-based estimators; dual coarse-to-fine search; frequency; multiple speech sound sources; real-time implementation; space; teleconferencing; videoconferencing systems; Array signal processing; Computer interfaces; Delay effects; Delay estimation; Inverse problems; Laboratories; Position measurement; Sensor arrays; Signal processing algorithms; Speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location
Salt Lake City, UT
ISSN
1520-6149
Print_ISBN
0-7803-7041-4
Type
conf
DOI
10.1109/ICASSP.2001.940366
Filename
940366
Link To Document