DocumentCode
2389720
Title
Learning sound location from a single microphone
Author
Saxena, Ashutosh ; Ng, Andrew Y.
Author_Institution
Comput. Sci. Dept., Stanford Univ., Stanford, CA, USA
fYear
2009
fDate
12-17 May 2009
Firstpage
1737
Lastpage
1742
Abstract
We consider the problem of estimating the incident angle of a sound, using only a single microphone. The ability to perform monaural (single-ear) localization is important to many animals; indeed, monaural cues are also the primary method by which humans decide if a sound comes from the front or back, as well as estimate its elevation. Such monaural localization is made possible by the structure of the pinna (outer ear), which modifies sound in a way that is dependent on its incident angle. In this paper, we propose a machine learning approach to monaural localization, using only a single microphone and an ldquoartificial pinnardquo (that distorts sound in a direction-dependent way). Our approach models the typical distribution of natural and artificial sounds, as well as the direction-dependent changes to sounds induced by the pinna. Our experimental results also show that the algorithm is able to fairly accurately localize a wide range of sounds, such as human speech, dog barking, waterfall, thunder, and so on. In contrast to microphone arrays, this approach also offers the potential of significantly more compact, as well as lower cost and power, devices for sounds localization.
Keywords
acoustic signal processing; learning (artificial intelligence); artificial pinna; incident angle estimation; machine learning approach; monaural localization; single microphone; sound location learning; sounds localization; Biological systems; Computer science; Costs; Ear; Humans; Microphone arrays; Organisms; Robotics and automation; Robots; USA Councils;
fLanguage
English
Publisher
ieee
Conference_Titel
Robotics and Automation, 2009. ICRA '09. IEEE International Conference on
Conference_Location
Kobe
ISSN
1050-4729
Print_ISBN
978-1-4244-2788-8
Electronic_ISBN
1050-4729
Type
conf
DOI
10.1109/ROBOT.2009.5152861
Filename
5152861
Link To Document