DocumentCode
1377026
Title
Zero-crossing-based speech segregation and recognition for humanoid robots
Author
An, Sung Jun ; Kil, Rhee Man ; Kim, Young-Ik
Author_Institution
Dept. of Math. Sci., Korea Adv. Inst. of Sci. & Technol. (KAIST), Daejeon, South Korea
Volume
55
Issue
4
fYear
2009
fDate
11/1/2009 12:00:00 AM
Firstpage
2341
Lastpage
2348
Abstract
Nowadays, humanoid robots attract people since their overall appearance is similar to the human body, allowing interaction with humans and the surrounding environment. In the case of the auditory interaction with humans, it is desirable that humanoid robots have similar capacity to the human¿s auditory information processing system. This is a very difficult task, since current automatic speech recognition (ASR) systems are not quite robust to noise and it¿s hard to attend to the selected speech source. In this context, this paper presents a new method of zero-crossing based binaural mask estimation for speech segregation and recognition, when multiple sound sources are present simultaneously. The proposed method provides high performance of speech segregation and recognition while offers significantly less computational complexity compared to the conventional methods based on cross-correlation. We expect that this method would be able to provide an effective tool for the auditory interaction with humanoid robots using the sensory information of binaural sounds.
Keywords
estimation theory; humanoid robots; speech intelligibility; speech recognition; auditory interaction; binaural mask estimation; humanoid robot; speech recognition; zero-crossing-based speech segregation; Acoustic noise; Automatic speech recognition; Humanoid robots; Humans; Information processing; Noise robustness; Speech coding; Speech enhancement; Speech recognition; Working environment noise; zero-crossings, sound source localization, speech segregation, speech recognition;
fLanguage
English
Journal_Title
Consumer Electronics, IEEE Transactions on
Publisher
ieee
ISSN
0098-3063
Type
jour
DOI
10.1109/TCE.2009.5373808
Filename
5373808
Link To Document