Title :
A new hierarchical binaural sound source localization method based on Interaural Matching Filter
Author :
Hong Liu ; Jie Zhang ; Zhuo Fu
Author_Institution :
Fac. of Key Lab. of Machine perception & Intell., Peking Univ., Shenzhen, China
fDate :
May 31 2014-June 7 2014
Abstract :
Binaural sound source localization is an important technique in friendly Human-Robot Interaction (HRI) for its easy-implementation with only two microphones. This paper develops a robust method based on a hierarchical probabilistic model. Reliable frequency sub-bands are used to PHAT - ργ method in the first layer to obtain a priori crude Interaural Time-Delay (ITD) and the probabilistic distribution of candidate azimuths. The second layer utilizes Interaural Intensity Difference (IID) to reduce matching time and refine candidate azimuths as well as elevations. A novel feature named Interaural Matching Filter (IMF), which can eliminate the difference between ITDs and IIDs, is proposed in the third layer. The probability of sound source location is acquired by computing the similarity between the IMF of received binaural signal and the IMFs in templates. Finally, combined with the former ITDs and IIDs, the similarity matrix is used to make a decision of sound source location based on a Bayes rule. Our innovation lies in adding selecting reliable frequency components into time-delay estimation and foremost taking IMF as a feature of sound source. Compared with several state-of-the-art algorithms, experimental results show our approach has a better performance even in noisy environments without increasing storages, and also has less time complexity.
Keywords :
Bayes methods; acoustic signal processing; delays; filtering theory; human-robot interaction; pattern matching; statistical distributions; Bayes rule; HRI; IID; ITD; PHAT-ργ method; binaural sound source localization method; frequency subbands; human-robot interaction; interaural intensity difference; interaural matching filter; interaural time-delay; microphones; probabilistic distribution; Azimuth; Ear; Matched filters; Noise; Probabilistic logic; Reliability; Time-frequency analysis;
Conference_Titel :
Robotics and Automation (ICRA), 2014 IEEE International Conference on
Conference_Location :
Hong Kong
DOI :
10.1109/ICRA.2014.6907065