DocumentCode :
3530592
Title :
Learning to maximize signal-to-noise ratio for reverberant speech segregation
Author :
Jin, Zhaozhang ; Wang, DeLiang
Author_Institution :
Dept. of Comput. Sci., Ohio State Univ., Columbus, OH
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
4689
Lastpage :
4692
Abstract :
Monaural speech segregation in reverberant environments is a very difficult problem. We develop a supervised learning approach by proposing an objective function that directly relates to the computational goal of maximizing signal-to-noise ratio. The model trained using this new objective function yields significantly better results for time-frequency unit labeling. In our segregation system, a segmentation and grouping framework is utilized to form reliable segments under reverberant conditions and organize them into streams. Systematic evaluations show very promising results.
Keywords :
learning (artificial intelligence); speech processing; grouping framework; monaural speech segregation; objective function; reverberant speech segregation; segmentation framework; signal-to-noise ratio; supervised learning approach; time-frequency unit labeling; Filtering; Image analysis; Labeling; Power harmonic filters; Reverberation; Robustness; Signal to noise ratio; Speech; Supervised learning; Time frequency analysis; Computational auditory scene analysis; monaural speech segregation; objective function; room reverberation; supervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960677
Filename :
4960677
Link To Document :
بازگشت