DocumentCode :
3510711
Title :
An algorithm for speech segregation of co-channel speech
Author :
Vishnubhotla, Srikanth ; Espy-Wilson, Carol Y.
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Maryland, College Park, MD
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
109
Lastpage :
112
Abstract :
This paper introduces an algorithm to separate speech streams from a single-channel speech mixture. Most current speech segregation algorithms allocate speech regions to participating speakers depending on which speaker dominates in which spectro-temporal region. The proposed method is a different approach to speech segregation, in that it separates the participating speaker streams rather than decide in the favor of the dominating speaker. The algorithm depends on a lease-squares fitting approach to model the speech mixture as a sum of complex exponentials. The algorithm gives results that are better than an existent algorithm when tested on the same task. The performance on a different database yielded good segregation results, even for Target-to-Masker ratios as low as -15 dB. The algorithm has immense promise for improvement and practical implementation.
Keywords :
least squares approximations; speech intelligibility; speech processing; co-channel speech segregation; lease-squares fitting approach; spectro-temporal region; Databases; Educational institutions; Image analysis; Loudspeakers; Signal processing; Spectrogram; Speech analysis; Speech enhancement; Speech processing; Testing; Target-to-Masker Ratio; auditory scene analysis; co-channel speech; monaural speech; speech segregation; speech separation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4959532
Filename :
4959532
Link To Document :
بازگشت