DocumentCode
1353781
Title
Background Music Removal Based on Cepstrum Transformation for Popular Singer Identification
Author
Tsai, Wei-Ho ; Lin, Hao-Ping
Author_Institution
Dept. of Electron. Eng., Nat. Taipei Univ. of Technol., Taipei, Taiwan
Volume
19
Issue
5
fYear
2011
fDate
7/1/2011 12:00:00 AM
Firstpage
1196
Lastpage
1205
Abstract
One major challenge of identifying singers in popular music recordings lies in how to reduce the interference of background accompaniment in trying to characterize the singer voice. Although a number of studies on automatic Singer IDentification (SID) from acoustic features have been reported, most systems to date, however, do not explicitly deal with the background accompaniment. This study proposes a background accompaniment removal approach for SID by exploiting the underlying relationships between solo singing voices and their accompanied versions in cepstrum. The relationships are characterized by a transformation estimated using a large set of accompanied singing generated by manually mixing solo singing with the accompaniments extracted from Karaoke VCDs. Such a transformation reflects the cepstrum variations of a singing voice before and after it is added with accompaniments. When an unknown accompanied voice is presented to our system, the transformation is performed to convert the cepstrum of the accompanied voice into a solo-voice-like one. Our experiments show that such a background removal approach improves the SID accuracy significantly; even when a test music recording involves sung language not covered in the data for estimating the transformation.
Keywords
audio recording; music; Karaoke VCD; SID accuracy; acoustic features; automatic singer identification; background accompaniment removal approach; background music removal; cepstrum transformation; interference reduction; popular music recordings; popular singer identification; singer voice; solo singing voices; Cepstrum; Feature extraction; Indexes; Instruments; Materials; Testing; Training; Background accompaniment; cepstrum transformation; singer identification (SID);
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2010.2087752
Filename
5604657
Link To Document