DocumentCode
634117
Title
Localization of multiple simultaneous speakers by combining the information from different subbands
Author
Firoozabadi, Ali Dehghan ; Abutalebi, H.R.
Author_Institution
Electr. & Comput. Eng. Dept., Yazd Univ., Yazd, Iran
fYear
2013
fDate
14-16 May 2013
Firstpage
1
Lastpage
6
Abstract
Time Difference Of Arrival (TDOA)-based algorithms are the main methods for speech source localization. A category of these methods are based on Generalized Cross Correlation (GCC). These methods estimate the source location based on the calculated TDOA between microphones signals. The accuracy of these methods decreases as the amount of noise and reverberation increases. In this paper, we propose the utilization of subband processing for the localization of two simultaneous speech sources. While the conventional methods consider the whole signal spectrum identically in the localization procedure, the proposed method takes advantage of the differences in the frequency bands of the mixed speech for the localization of multiple speakers. Actually, the proposed method computes the GCC in the different frequency bands and then, combines the information from the subbands in a so-called smart manner. We have discussed several approaches for the combination of subband. The performance evaluations in different environmental conditions demonstrate the superiority of the proposed method compared to the fullband GCC method. The proposed method considerably increases the accuracy of simultaneous speaker localization.
Keywords
correlation methods; direction-of-arrival estimation; microphones; speaker recognition; GCC; TDOA-based algorithm; generalized cross correlation; microphones signal; performance evaluation; signal spectrum; simultaneous speaker localization; speech source localization; subband processing; time difference of arrival-based algorithm; Accuracy; Direction-of-arrival estimation; Estimation; Histograms; Microphones; Noise measurement; Speech; DOA; Generalized Cross Correlation; Multi Source Localization; PHAT filter; Subband Processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Electrical Engineering (ICEE), 2013 21st Iranian Conference on
Conference_Location
Mashhad
Type
conf
DOI
10.1109/IranianCEE.2013.6599672
Filename
6599672
Link To Document