DocumentCode :
427168
Title :
Singing voice detection using twice-iterated composite Fourier transform
Author :
Maddage, Namunu Chinthaka ; Wan, Kongwah ; Xu, Changsheng ; Ye Wang
Author_Institution :
Inst. for Infocomm Res., Singapore
Volume :
2
fYear :
2004
fDate :
30-30 June 2004
Firstpage :
1347
Abstract :
In this paper, we propose a twice-iterated composite Fourier transform (TICFT) technique to detect the singing voice boundaries from acoustical polyphonic music signals. We show that the cumulative TICFT energy in the lower coefficients is capable of differentiating the harmonic structures of vocal and instrumental music in higher octaves. The musical signal is first segmented into frames based on quarter-notes. Then TICFT is used to measure the harmonic structure of each frame. Finally, the vocal and instrumental frames are classified by applying music domain knowledge. Experimental results show over 80% frame level accuracy can be achieved
Keywords :
Fourier transforms; audio signal processing; music; acoustical polyphonic music signals; cumulative TICFT energy; instrumental frame classification; quarter-notes; singing voice boundaries; singing voice detection; twice-iterated composite Fourier transform; vocal frame classification; Acoustic measurements; Acoustic signal detection; Band pass filters; Fourier transforms; Instruments; Multiple signal classification; Music information retrieval; Power harmonic filters; Rhythm; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2004. ICME '04. 2004 IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
0-7803-8603-5
Type :
conf
DOI :
10.1109/ICME.2004.1394478
Filename :
1394478
Link To Document :
بازگشت