Separation of Singing Voice Using Nonnegative Matrix Partial Co-Factorization for Singer Identification

Author

Ying Hu ; Guizhong Liu

Author_Institution

Sch. of Electron. & Inf. Eng., Xian Jiaotong Univ., Xian, China

Volume

23

Issue

4

fYear

2015

fDate

Apr-15

Firstpage

643

Lastpage

653

Abstract

In order to improve the performance of singer identification, we propose a system to separate singing voice from music accompaniment for monaural recordings. Our system consists of two key stages. The first stage exploits the nonnegative matrix partial co-factorization (NMPCF), which is a joint matrix decomposition integrating prior knowledge of singing voice and pure accompaniment to separate the mixture signal into singing voice portion and accompaniment portion. In the second stage, based on the separated singing voice obtained by the first stage, the pitches of singing voice are first estimated and then the harmonic components of singing voice can be distinguished. For a frame, the distinguished harmonic components are regarded as reliable while other frequency components unreliable, thus the spectrum is incomplete. With those harmonic components, the complete spectrums of singing voice can be reconstructed by a missing feature method, spectrum reconstruction, obtaining a refined signal with more clean singing voice. Experimental results demonstrate that, from the point view of source separation, the singing voice refinement can further improve ΔSNR in contrast with the singing voice separation using NMPCF, while for the point view of singer identification, the singing voice separated by NMPCF is more appropriate than the refined singing voice.

Keywords

matrix decomposition; music; signal reconstruction; source separation; speaker recognition; NMPCF; accompaniment portion; frequency components; harmonic components; joint matrix decomposition; missing feature method; mixture signal separation; monaural recordings; music accompaniment; nonnegative matrix partial co-factorization; refined signal; singer identification; singing voice portion; singing voice refinement; singing voice separation; source separation; spectrum reconstruction; Feature extraction; Harmonic analysis; IEEE transactions; Instruments; Matrix decomposition; Source separation; Spectrogram; Nonnegative matrix partial co-factorization (NMPCF); singer identification; singing voice separation; spectrum reconstruction;

fLanguage

English

Journal_Title

Audio, Speech, and Language Processing, IEEE/ACM Transactions on

Publisher

ieee

ISSN

2329-9290

Type

jour

DOI

10.1109/TASLP.2015.2396681

Filename

7021947