• DocumentCode
    634117
  • Title

    Localization of multiple simultaneous speakers by combining the information from different subbands

  • Author

    Firoozabadi, Ali Dehghan ; Abutalebi, H.R.

  • Author_Institution
    Electr. & Comput. Eng. Dept., Yazd Univ., Yazd, Iran
  • fYear
    2013
  • fDate
    14-16 May 2013
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Time Difference Of Arrival (TDOA)-based algorithms are the main methods for speech source localization. A category of these methods are based on Generalized Cross Correlation (GCC). These methods estimate the source location based on the calculated TDOA between microphones signals. The accuracy of these methods decreases as the amount of noise and reverberation increases. In this paper, we propose the utilization of subband processing for the localization of two simultaneous speech sources. While the conventional methods consider the whole signal spectrum identically in the localization procedure, the proposed method takes advantage of the differences in the frequency bands of the mixed speech for the localization of multiple speakers. Actually, the proposed method computes the GCC in the different frequency bands and then, combines the information from the subbands in a so-called smart manner. We have discussed several approaches for the combination of subband. The performance evaluations in different environmental conditions demonstrate the superiority of the proposed method compared to the fullband GCC method. The proposed method considerably increases the accuracy of simultaneous speaker localization.
  • Keywords
    correlation methods; direction-of-arrival estimation; microphones; speaker recognition; GCC; TDOA-based algorithm; generalized cross correlation; microphones signal; performance evaluation; signal spectrum; simultaneous speaker localization; speech source localization; subband processing; time difference of arrival-based algorithm; Accuracy; Direction-of-arrival estimation; Estimation; Histograms; Microphones; Noise measurement; Speech; DOA; Generalized Cross Correlation; Multi Source Localization; PHAT filter; Subband Processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical Engineering (ICEE), 2013 21st Iranian Conference on
  • Conference_Location
    Mashhad
  • Type

    conf

  • DOI
    10.1109/IranianCEE.2013.6599672
  • Filename
    6599672