DocumentCode :
3195176
Title :
Audio-Based Estimation of Speakers Directions for Multimedia Meeting Logs
Author :
Yokoe, Yuki ; Ito, Yoshimichi ; Babaguchi, Noboru
Author_Institution :
Osaka Univ., Osaka
fYear :
2007
fDate :
2-5 July 2007
Firstpage :
212
Lastpage :
215
Abstract :
This paper is concerned with an audio-based method for estimating speaker directions in meeting environment. It is well-known that cross-power spectrum phase (CSP) analysis is a very powerful tool for localizing sound sources. However, when we adopt the CSP-based method together with a circular microphone array system to estimate the speaker directions in 360-degree range (e.g. round-table discussions), the method fails to estimate the directions due to the existence of imaginary peaks of CSP coefficients. In order to circumvent the above problem, we propose a method to suppress the imaginary peaks, which uses a circular-array version of the method proposed by Nishiura and appropriate scaling around the imaginary peaks. Experimental results are also shown to demonstrate the effectiveness of the proposed method.
Keywords :
audio signal processing; direction-of-arrival estimation; microphone arrays; speaker recognition; audio-based estimation; circular microphone array system; circular-array; cross-power spectrum phase analysis; imaginary peaks suppression; multimedia meeting logs; speaker directions estimation; Acoustical engineering; Cameras; Delay effects; Delay estimation; Indium tin oxide; Intelligent systems; Loudspeakers; Microphone arrays; Power engineering and energy; Prototypes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2007 IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
1-4244-1016-9
Electronic_ISBN :
1-4244-1017-7
Type :
conf
DOI :
10.1109/ICME.2007.4284624
Filename :
4284624
Link To Document :
بازگشت