DocumentCode :
2899069
Title :
A hidden Markov model based approach to music segmentation and identification
Author :
Sheng Gao ; Maddage, N.C. ; Lee, Chin-Hui
Author_Institution :
Inst. for Infocomm Res., Singapore, Singapore
Volume :
3
fYear :
2003
fDate :
15-18 Dec. 2003
Firstpage :
1576
Abstract :
Classification of musical segments is an interesting problem. It is a key technology in the development of content-based audio document indexing and retrieval. In this paper, we apply the feature extraction and modeling techniques commonly used in automatic speech recognition to solving the problem of segmentation and instrument identification of musical passages. The correlation among the different components in the feature space and the auto-correlation of each component are analyzed to demonstrate feasibility in musical signal analysis and instrument class modeling. Our experimental results are first evaluated on 3 instrument categories, i.e. vocal music, instrumental music, and their combinations. Furthermore each category is split into two individual cases to give a 6-class problem. Our results show that good performance could be obtained with simple features, such as mel-frequency cepstral coefficients and cepstral coefficients derived from linear prediction signal analysis. Even with a limited amount of training data, we could give an accuracy of 90.60% in the case of three categories. A slightly worse accuracy of 90.38% is obtained when we double the number of categories to six classes.
Keywords :
audio signal processing; cepstral analysis; content-based retrieval; correlation methods; hidden Markov models; indexing; music; signal classification; speech recognition; audio indexing; audio retrieval; automatic speech recognition; cepstral coefficients; component correlations; feature extraction; hidden Markov model; instrumental music; linear prediction signal analysis; music segmentation; musical passage identification; musical segment classification; musical signal analysis; time-varying signals; vocal music; Automatic speech recognition; Cepstral analysis; Content based retrieval; Feature extraction; Hidden Markov models; Indexing; Instruments; Music information retrieval; Signal analysis; Space technology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information, Communications and Signal Processing, 2003 and Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint Conference of the Fourth International Conference on
Print_ISBN :
0-7803-8185-8
Type :
conf
DOI :
10.1109/ICICS.2003.1292732
Filename :
1292732
Link To Document :
بازگشت