DocumentCode
2592455
Title
Sound source separation for a robot based on pitch
Author
Heckmann, Martin ; Joublin, Frank ; Körner, Edgar
Author_Institution
Honda Res. Inst. Eur. GmbH, Offenbach, Germany
fYear
2005
fDate
2-6 Aug. 2005
Firstpage
2197
Lastpage
2202
Abstract
We present a novel method for the separation of monaurally recorded speech signals based on pitch. Our method is inspired by the ability of some auditory neurons to phase lock with the excitation signal. After applying a Gammatone filter-bank on the original signal we compare the distances between zero crossings of possible harmonics and decide upon the result of this comparison if they share the same fundamental and hence originate from the same sound source. For higher frequencies we use the amplitude modulation property of unresolved harmonics to determine their fundamental frequency. When comparing our method to standard autocorrelation based methods we see that the pitch can be tracked more precisely and especially opens the way to extract also the pitch contour of a second speaker or other sound sources which can be of importance for the robots behavior. Tests in sound source separation of our algorithm on a database with several speakers and a large set of intrusions show that our algorithm performs slightly better than the commonly used autocorrelation at lower computational costs.
Keywords
amplitude modulation; audio signal processing; robots; source separation; speech processing; Gammatone filter-bank; amplitude modulation; monaural sound source separation; pitch estimation; robot behavior; speech signals; zero crossing distances; Acoustic testing; Amplitude modulation; Autocorrelation; Frequency; Loudspeakers; Neurons; Power harmonic filters; Robots; Source separation; Speech; Amplitude Modulation; Histogram; Monaural Sound Source Separation; Zero Crossing Distances;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Robots and Systems, 2005. (IROS 2005). 2005 IEEE/RSJ International Conference on
Print_ISBN
0-7803-8912-3
Type
conf
DOI
10.1109/IROS.2005.1544982
Filename
1544982
Link To Document