DocumentCode :
1721314
Title :
Binaural and Multiple-Microphone Signal Processing Motivated by Auditory Perception
Author :
Stern, Richard M. ; Gouvêa, Evandro ; Kim, Chanwoo ; Kumar, Kshitiz ; Park, Hyung-Min
Author_Institution :
Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA
fYear :
2008
Firstpage :
98
Lastpage :
103
Abstract :
It is well known that binaural processing is very useful for separating incoming sound sources as well as for improving the intelligibility of speech in reverberant environments. This paper describes and compares a number of ways in which the classic model of interaural cross-correlation proposed by Jeffress, quantified by Colburn, and further elaborated by Blauert, Lindemann, and others, can be applied to improving the accuracy of automatic speech recognition systems operating in cluttered, noisy, and reverberant environments. Typical implementations begin with an abstraction of cross-correlation of the incoming signals after nonlinear monaural bandpass processing, but there are many alternative implementation choices that can be considered. Typical implementations differ in the ways in which an enhanced version of the desired signal is developed using binaural principles, in the extent to which specific processing mechanisms are used to impose suppression motivated by the precedence effect, and in the precise mechanism used to extract interaural time differences.
Keywords :
audio signal processing; correlation methods; hearing; microphones; reverberation; source separation; speech intelligibility; speech recognition; auditory perception; automatic speech recognition system; binaural signal processing; cross-correlation; multiple-microphone signal processing; reverberant environment; sound source separation; speech intelligibility; Acoustic noise; Acoustic signal processing; Auditory system; Delay; Ear; Frequency estimation; Signal processing; Speech processing; Speech recognition; Working environment noise; auditory models; binaural hearing; reverberation; robust speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Hands-Free Speech Communication and Microphone Arrays, 2008. HSCMA 2008
Conference_Location :
Trento
Print_ISBN :
978-1-4244-2337-8
Electronic_ISBN :
978-1-4244-2338-5
Type :
conf
DOI :
10.1109/HSCMA.2008.4538697
Filename :
4538697
Link To Document :
بازگشت