• DocumentCode
    1837906
  • Title

    Direction of arrival estimation for speech sources using fourth order cross cumulants

  • Author

    Swartling, Mikael ; Sallberg, Benny ; Grbic, Nedelko

  • Author_Institution
    Dept. of Signal Process., Blekinge Inst. of Technol., Ronneby
  • fYear
    2008
  • fDate
    18-21 May 2008
  • Firstpage
    1696
  • Lastpage
    1699
  • Abstract
    In many applications where speech separation and enhancement is of interest, e.g. conferencing systems, mobile phones and hearing aids, accurate speaker localization is important. This paper presents an alternative criteria for the well known steered response power with phase transform (SRP-PHAT) algorithm, in which the steered response relates to peaks in the fourth order cross cumulant, rather than peaks in the second order cross cumulant, i.e. the cross power spectrum. Since speech sources have a probability density function (PDF) close to the Laplacian distribution and noise are generally closer to the Gaussian distribution, the fourth order cumulant becomes a good alternative for the steered response search for speech sources. The proposed method is evaluated and compared to the original SRP-PHAT algorithm and shows significant improvements in localization performance for speech sources.
  • Keywords
    Gaussian distribution; Laplace transforms; direction-of-arrival estimation; speaker recognition; speech enhancement; Gaussian distribution; Laplacian distribution; direction of arrival estimation; fourth order cross cumulants; phase transform; probability density function; speaker localization; speech enhancement; speech separation; speech sources; steered response power; Acoustic sensors; Direction of arrival estimation; Gaussian noise; Laplace equations; Probability density function; Sensor arrays; Signal processing algorithms; Speech enhancement; Speech processing; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Circuits and Systems, 2008. ISCAS 2008. IEEE International Symposium on
  • Conference_Location
    Seattle, WA
  • Print_ISBN
    978-1-4244-1683-7
  • Electronic_ISBN
    978-1-4244-1684-4
  • Type

    conf

  • DOI
    10.1109/ISCAS.2008.4541763
  • Filename
    4541763