مرکز منطقه ای اطلاع رساني علوم و فناوري - An algorithm for speech segregation of co-channel speech

DocumentCode :

3510711

Title :

An algorithm for speech segregation of co-channel speech

Author :

Vishnubhotla, Srikanth ; Espy-Wilson, Carol Y.

Author_Institution :

Dept. of Electr. & Comput. Eng., Univ. of Maryland, College Park, MD

fYear :

2009

fDate :

19-24 April 2009

Firstpage :

109

Lastpage :

112

Abstract :

This paper introduces an algorithm to separate speech streams from a single-channel speech mixture. Most current speech segregation algorithms allocate speech regions to participating speakers depending on which speaker dominates in which spectro-temporal region. The proposed method is a different approach to speech segregation, in that it separates the participating speaker streams rather than decide in the favor of the dominating speaker. The algorithm depends on a lease-squares fitting approach to model the speech mixture as a sum of complex exponentials. The algorithm gives results that are better than an existent algorithm when tested on the same task. The performance on a different database yielded good segregation results, even for Target-to-Masker ratios as low as -15 dB. The algorithm has immense promise for improvement and practical implementation.

Keywords :

least squares approximations; speech intelligibility; speech processing; co-channel speech segregation; lease-squares fitting approach; spectro-temporal region; Databases; Educational institutions; Image analysis; Loudspeakers; Signal processing; Spectrogram; Speech analysis; Speech enhancement; Speech processing; Testing; Target-to-Masker Ratio; auditory scene analysis; co-channel speech; monaural speech; speech segregation; speech separation;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on

Conference_Location :

Taipei

ISSN :

1520-6149

Print_ISBN :

978-1-4244-2353-8

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2009.4959532

Filename :

4959532

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3510711