• DocumentCode
    1403751
  • Title

    A Probabilistic Model for Robust Localization Based on a Binaural Auditory Front-End

  • Author

    May, Tobias ; Van De Par, Steven ; Kohlrausch, Armin

  • Author_Institution
    Inst. of Phys., Univ. of Oldenburg, Oldenburg, Germany
  • Volume
    19
  • Issue
    1
  • fYear
    2011
  • Firstpage
    1
  • Lastpage
    13
  • Abstract
    Although extensive research has been done in the field of machine-based localization, the degrading effect of reverberation and the presence of multiple sources on localization performance has remained a major problem. Motivated by the ability of the human auditory system to robustly analyze complex acoustic scenes, the associated peripheral stage is used in this paper as a front-end to estimate the azimuth of sound sources based on binaural signals. One classical approach to localize an acoustic source in the horizontal plane is to estimate the interaural time difference (ITD) between both ears by searching for the maximum in the cross-correlation function. Apart from ITDs, the interaural level difference (ILD) can contribute to localization, especially at higher frequencies where the wavelength becomes smaller than the diameter of the head, leading to ambiguous ITD information. The interdependency of ITD and ILD on azimuth is a complex pattern that depends also on the room acoustics, and is therefore learned by azimuth-dependent Gaussian mixture models (GMMs). Multiconditional training is performed to take into account the variability of the binaural features which results from multiple sources and the effect of reverberation. The proposed localization model outperforms state-of-the-art localization techniques in simulated adverse acoustic conditions.
  • Keywords
    Gaussian processes; audio signal processing; correlation methods; estimation theory; probability; reverberation; Gaussian mixture models; acoustic source localisation; associated peripheral stage; binaural auditory front-end; cross-correlation function; human auditory system; interaural level difference; interaural time difference; localization performance; machine-based localization; probabilistic model; robust localization; sound sources; Auditory system; Azimuth; Degradation; Ear; Frequency; Humans; Layout; Reverberation; Robustness; Signal analysis; Localization; auditory scene analysis (ASA); binaural; interaural level difference (ILD); interaural time difference (ITD); reverberation;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2010.2042128
  • Filename
    5406118