مرکز منطقه ای اطلاع رساني علوم و فناوري - Learning to maximize signal-to-noise ratio for reverberant speech segregation

DocumentCode :

3530592

Title :

Learning to maximize signal-to-noise ratio for reverberant speech segregation

Author :

Jin, Zhaozhang ; Wang, DeLiang

Author_Institution :

Dept. of Comput. Sci., Ohio State Univ., Columbus, OH

fYear :

2009

fDate :

19-24 April 2009

Firstpage :

4689

Lastpage :

4692

Abstract :

Monaural speech segregation in reverberant environments is a very difficult problem. We develop a supervised learning approach by proposing an objective function that directly relates to the computational goal of maximizing signal-to-noise ratio. The model trained using this new objective function yields significantly better results for time-frequency unit labeling. In our segregation system, a segmentation and grouping framework is utilized to form reliable segments under reverberant conditions and organize them into streams. Systematic evaluations show very promising results.

Keywords :

learning (artificial intelligence); speech processing; grouping framework; monaural speech segregation; objective function; reverberant speech segregation; segmentation framework; signal-to-noise ratio; supervised learning approach; time-frequency unit labeling; Filtering; Image analysis; Labeling; Power harmonic filters; Reverberation; Robustness; Signal to noise ratio; Speech; Supervised learning; Time frequency analysis; Computational auditory scene analysis; monaural speech segregation; objective function; room reverberation; supervised learning;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on

Conference_Location :

Taipei

ISSN :

1520-6149

Print_ISBN :

978-1-4244-2353-8

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2009.4960677

Filename :

4960677

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3530592