Title :
Audio-Based Estimation of Speakers Directions for Multimedia Meeting Logs
Author :
Yokoe, Yuki ; Ito, Yoshimichi ; Babaguchi, Noboru
Author_Institution :
Osaka Univ., Osaka
Abstract :
This paper is concerned with an audio-based method for estimating speaker directions in meeting environment. It is well-known that cross-power spectrum phase (CSP) analysis is a very powerful tool for localizing sound sources. However, when we adopt the CSP-based method together with a circular microphone array system to estimate the speaker directions in 360-degree range (e.g. round-table discussions), the method fails to estimate the directions due to the existence of imaginary peaks of CSP coefficients. In order to circumvent the above problem, we propose a method to suppress the imaginary peaks, which uses a circular-array version of the method proposed by Nishiura and appropriate scaling around the imaginary peaks. Experimental results are also shown to demonstrate the effectiveness of the proposed method.
Keywords :
audio signal processing; direction-of-arrival estimation; microphone arrays; speaker recognition; audio-based estimation; circular microphone array system; circular-array; cross-power spectrum phase analysis; imaginary peaks suppression; multimedia meeting logs; speaker directions estimation; Acoustical engineering; Cameras; Delay effects; Delay estimation; Indium tin oxide; Intelligent systems; Loudspeakers; Microphone arrays; Power engineering and energy; Prototypes;
Conference_Titel :
Multimedia and Expo, 2007 IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
1-4244-1016-9
Electronic_ISBN :
1-4244-1017-7
DOI :
10.1109/ICME.2007.4284624