DocumentCode
799585
Title
Binaural Sound Source Distance Learning in Rooms
Author
Vesa, Sampo
Author_Institution
Dept. of Media Technol., Helsinki Univ. of Technol. (TKK), Espoo, Finland
Volume
17
Issue
8
fYear
2009
Firstpage
1498
Lastpage
1507
Abstract
A method for learning the distance of a sound source in a room is presented. The proposed method is based on short-time magnitude-squared coherence between the two channels of a binaural signal. Based on white noise as the training data, a coherence profile is obtained at each desired position in the room. These profiles can then be used to identify the most likely distance of a speech signal in the same room. The proposed approach is compared to a previous method for learning the position of a sound source. The results indicate that the both methods are able to identify the distance of a speech sound source correctly in a grid with 0.5-m spacing in most cases, when the orientation of the listener is 0deg , 30deg , 60deg , 90deg , or 180deg on the horizontal plane.
Keywords
speech processing; binaural sound source; distance learning; short-time magnitude-squared coherence; sound source position; speech signal distance; Binaural signal; coherence; distance measurement; localization;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2009.2022001
Filename
4907086
Link To Document