DocumentCode :
1787527
Title :
Performance evaluation of single channel speech separation using non-negative matrix factorization
Author :
Nandakumar, M. Mona ; Bijoy, K. Edet
Author_Institution :
Dept. of Electron. & Commun. Eng., MES Coll. of Eng., Kuttippuram, India
fYear :
2014
fDate :
10-12 Oct. 2014
Firstpage :
1
Lastpage :
4
Abstract :
Blind Source Separation (BSS) of underdetermined mixture has acquired a huge attention in signal processing environment, even though it is very much difficult to separate the underlying sources. The difficulty in source separation arise due to the mixing of large number of source signals in time and frequency, and propagation of it to one or more sensors through air. The objective in BSS is to identify the underlying source signals based on measurements of the mixed sources. Among many of the techniques used in BSS, due to its direct, easy to code and intuitive interpretability of basis and activation components, NMF provide an accurate form of parts-based representation of underlying data. Even though both supervised and unsupervised modes of operations are used in NMF, supervised mode performs well due to the use of pre-learned basis vectors corresponding to each underlying source. In this paper two of the multiplicative algorithms, Regularized Expectation Minimization Maximum Likelihood Algorithm (REMML) and Regularized Image Space Reconstruction Algorithm (RISRA) with sparseness constraint are taken to evaluate the performance of BSS. By the use of speech and music mixtures, Signal to Distortion Ratio (SDR), Signal to Interference Ratio (SIR) and Signal to Artifact Ratio (SAR) are evaluated.
Keywords :
blind source separation; expectation-maximisation algorithm; matrix decomposition; performance evaluation; speech processing; BSS; NMF; REMML; RISRA; SAR; SDR; SIR; activation component; blind source separation; intuitive interpretability; mixed source signal; multiplicative algorithm; nonnegative matrix factorization; performance evaluation; prelearned basis vector; regularized expectation minimization maximum likelihood algorithm; regularized image space reconstruction algorithm; signal processing; signal to artifact ratio; signal to distortion ratio; signal to interference ratio; single channel speech separation; sparseness constraint; supervised operation mode; Cost function; Performance evaluation; Principal component analysis; Source separation; Sparse matrices; Speech; Vectors; BSS Evaluation; Blind Source Separation; EMML; ISRA; NMF; Source Separation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communication, Signal Processing and Networking (NCCSN), 2014 National Conference on
Conference_Location :
Palakkad
Type :
conf
DOI :
10.1109/NCCSN.2014.7001159
Filename :
7001159
Link To Document :
بازگشت