DocumentCode
427168
Title
Singing voice detection using twice-iterated composite Fourier transform
Author
Maddage, Namunu Chinthaka ; Wan, Kongwah ; Xu, Changsheng ; Ye Wang
Author_Institution
Inst. for Infocomm Res., Singapore
Volume
2
fYear
2004
fDate
30-30 June 2004
Firstpage
1347
Abstract
In this paper, we propose a twice-iterated composite Fourier transform (TICFT) technique to detect the singing voice boundaries from acoustical polyphonic music signals. We show that the cumulative TICFT energy in the lower coefficients is capable of differentiating the harmonic structures of vocal and instrumental music in higher octaves. The musical signal is first segmented into frames based on quarter-notes. Then TICFT is used to measure the harmonic structure of each frame. Finally, the vocal and instrumental frames are classified by applying music domain knowledge. Experimental results show over 80% frame level accuracy can be achieved
Keywords
Fourier transforms; audio signal processing; music; acoustical polyphonic music signals; cumulative TICFT energy; instrumental frame classification; quarter-notes; singing voice boundaries; singing voice detection; twice-iterated composite Fourier transform; vocal frame classification; Acoustic measurements; Acoustic signal detection; Band pass filters; Fourier transforms; Instruments; Multiple signal classification; Music information retrieval; Power harmonic filters; Rhythm; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Expo, 2004. ICME '04. 2004 IEEE International Conference on
Conference_Location
Taipei
Print_ISBN
0-7803-8603-5
Type
conf
DOI
10.1109/ICME.2004.1394478
Filename
1394478
Link To Document