مرکز منطقه ای اطلاع رساني علوم و فناوري - Speech segregation based on sound localization

DocumentCode :

1748960

Title :

Speech segregation based on sound localization

Author :

Roman, Nicoleta ; Wang, DeLiang ; Brown, Guy J.

Author_Institution :

Dept. of Comput. & Inf. Sci., Ohio State Univ., Columbus, OH, USA

Volume :

fYear :

2001

fDate :

2001

Firstpage :

2861

Abstract :

We study the cocktail-party effect, which refers to the ability of a listener to attend to a single talker in the presence of adverse acoustical conditions. It has been observed that this ability improves in the presence of binaural cues. In this paper, we explore a technique for speech segregation based on sound localization cues. The auditory masking phenomenon motivates an “ideal” binary mask in which time-frequency regions that correspond to the weak signal are cancelled. In our model we estimate this binary mask by observing that systematic changes of the interaural time differences and intensity differences occur as the energy ratio of the original signals is modified. The performance of our model is comparable with results obtained using the ideal binary mask and it shows a large improvement over existing pitch-based algorithms

Keywords :

hearing; physiological models; speech processing; time-frequency analysis; binary mask; binaural cues; energy ratio; sound localization; speech segregation; time-frequency regions; Acoustic noise; Auditory system; Azimuth; Frequency estimation; Image analysis; Information science; Noise figure; Psychoacoustic models; Source separation; Speech enhancement;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks, 2001. Proceedings. IJCNN '01. International Joint Conference on

Conference_Location :

Washington, DC

ISSN :

1098-7576

Print_ISBN :

0-7803-7044-9

Type :

conf

DOI :

10.1109/IJCNN.2001.938830

Filename :

938830

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1748960