DocumentCode :
3413637
Title :
On fusion of timbre-motivated features for singing voice detection and singer identification
Author :
Nwe, Tin Lay ; Li, Haizhou
Author_Institution :
Inst. for Infocomm Res., Singapore
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
2225
Lastpage :
2228
Abstract :
Timbre is the quality of sound which allows the ear to distinguish between musical sounds. In this paper, we study timbre effects in identification of singing voice segments in popular songs. Firstly, we identify between singing voice and instrumental segments in a song. Then, singing voice segments are further categorized according to their singer identity. Timbre-motivated effects are formulated by fusion of systems that use the features from vibrato, harmonic information and other features extracted using Mel and Log frequency scale filter banks. Statistical methods to select singing voice segments with high confidence measure are proposed for better performance in singer identification process. The experiments conducted on a database of 214 popular songs show that the proposed approach is effective.
Keywords :
feature extraction; music; Log frequency scale filter bank; Mel frequency scale filter bank; musical sound; singer identification process; statistical method; timbre-motivated feature; voice detection; Data mining; Ear; Feature extraction; Filter bank; Frequency; Instruments; Power harmonic filters; Spatial databases; Statistical analysis; Timbre; Singing voice; harmonic; singer identification; timbre; vibrato;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4518087
Filename :
4518087
Link To Document :
بازگشت