• DocumentCode
    799585
  • Title

    Binaural Sound Source Distance Learning in Rooms

  • Author

    Vesa, Sampo

  • Author_Institution
    Dept. of Media Technol., Helsinki Univ. of Technol. (TKK), Espoo, Finland
  • Volume
    17
  • Issue
    8
  • fYear
    2009
  • Firstpage
    1498
  • Lastpage
    1507
  • Abstract
    A method for learning the distance of a sound source in a room is presented. The proposed method is based on short-time magnitude-squared coherence between the two channels of a binaural signal. Based on white noise as the training data, a coherence profile is obtained at each desired position in the room. These profiles can then be used to identify the most likely distance of a speech signal in the same room. The proposed approach is compared to a previous method for learning the position of a sound source. The results indicate that the both methods are able to identify the distance of a speech sound source correctly in a grid with 0.5-m spacing in most cases, when the orientation of the listener is 0deg , 30deg , 60deg , 90deg , or 180deg on the horizontal plane.
  • Keywords
    speech processing; binaural sound source; distance learning; short-time magnitude-squared coherence; sound source position; speech signal distance; Binaural signal; coherence; distance measurement; localization;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2009.2022001
  • Filename
    4907086