Title :
An SVM based classification approach to speech separation
Author :
Han, Kun ; Wang, DeLiang
Author_Institution :
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
Abstract :
Monaural speech separation is a very challenging task. CASA-based systems utilize acoustic features to produce a time-frequency (T-F) mask. In this study, we propose a classification approach to monaural separation problem. Our feature set consists of pitch-based features and amplitude modulation spectrum features, which can discriminate both voiced and unvoiced speech from nonspeech interference. We employ support vector machines (SVMs) followed by a re-thresholding method to classify each T-F unit as either target-dominated or interference-dominated. An auditory segmentation stage is then utilized to improve SVM-generated results. Systematic evaluations show that our approach produces high quality binary masks and outperforms a previous system in terms of classification accuracy.
Keywords :
speech processing; support vector machines; CASA- based system; SVM; SVM based classification approach; T-F mask; acoustic features; amplitude modulation spectrum features; auditory segmentation stage; high quality binary masks; nonspeech interference; speech separation; support vector machines; time-frequency mask; unvoiced speech interference; Accuracy; Feature extraction; Noise; Speech; Support vector machines; Time frequency analysis; Training; IBM; Re-thresholding; SVM; Segmentation; Speech separation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947387