مرکز منطقه ای اطلاع رساني علوم و فناوري - Monaural Speech Separation Based on Computational Auditory Scene Analysis and Objective Quality Assessment of Speech

DocumentCode :

788296

Title :

Monaural Speech Separation Based on Computational Auditory Scene Analysis and Objective Quality Assessment of Speech

Author :

Li, Peng ; Guan, Yong ; Xu, Bo ; Liu, Wenju

Author_Institution :

Inst. of Autom., Chinese Acad. of Sci., Beijing

Volume :

Issue :

fYear :

2006

Firstpage :

2014

Lastpage :

2023

Abstract :

Monaural speech separation is a very challenging problem in speech signal processing. It has been studied extensively, and many separation systems based on computational auditory scene analysis (CASA) have been proposed in the last two decades. Although the research on CASA has tended to introduce high-level knowledge into separation processes using primitive data-driven methods, the knowledge on speech quality still has not been combined with it. This makes the performance evaluation of CASA mainly focused on the signal-to-noise ratio (SNR) improvement. Actually, the quality of the separated speech is not directly related to its SNR. In order to solve this problem, we propose a new method which combines CASA with objective quality assessment of speech (OQAS). In the grouping process of CASA, we use OQAS as the guide to instruct the CASA system. With this combination, the performance of the speech separation can be improved not only in SNR, but also in mean opinion score (MOS). Our system is systematically evaluated and compared with previous systems, and it yields substantially better performance, especially for the subjective perceptual quality of separated speech

Keywords :

hearing; speech processing; SNR; computational auditory scene analysis; mean opinion score; monaural speech separation; objective quality assessment of speech; signal-to-noise ratio; speech signal processing; Auditory system; Automatic speech recognition; Automation; Humans; Image analysis; Quality assessment; Separation processes; Signal processing; Speech analysis; Speech processing; Computational auditory scene analysis (CASA); grouping; monaural speech separation; objective quality assessment of speech (OQAS); segmentation;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2006.883258

Filename :

1709891

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=788296