DocumentCode
1351840
Title
Discovering Time-Constrained Sequential Patterns for Music Genre Classification
Author
Ren, Jia-Min ; Jang, Jyh-Shing Roger
Author_Institution
Dept. of Comput. Sci., Nat. Tsing Hua Univ., Hsinchu, Taiwan
Volume
20
Issue
4
fYear
2012
fDate
5/1/2012 12:00:00 AM
Firstpage
1134
Lastpage
1144
Abstract
A music piece can be considered as a sequence of sound events which represent both short-term and long-term temporal information. However, in the task of automatic music genre classification, most of text-categorization-based approaches could only capture temporal local dependencies (e.g., unigram and bigram-based occurrence statistics) to represent music contents. In this paper, we propose the use of time-constrained sequential patterns (TSPs) as effective features for music genre classification. First of all, an automatic language identification technique is performed to tokenize each music piece into a sequence of hidden Markov model indices. Then TSP mining is applied to discover genre-specific TSPs, followed by the computation of occurrence frequencies of TSPs in each music piece. Finally, support vector machine classifiers are employed based on these occurrence frequencies to perform the classification task. Experiments conducted on two widely used datasets for music genre classification, GTZAN and ISMIR2004Genre, show that the proposed method can discover more discriminative temporal structures and achieve a better recognition accuracy than the unigram and bigram-based statistical approach.
Keywords
data mining; feature extraction; hidden Markov models; music; natural languages; pattern classification; text analysis; GTZAN; ISMIR2004Genre; automatic language identification technique; automatic music genre classification; bigram-based statistical approach; discriminative temporal structure; genre-specific TSP mining; hidden Markov model index; music content; music piece; sound events sequence; support vector machine classifier; temporal information; temporal local dependency; text-categorization-based approach; time-constrained sequential pattern discovery; Computer science; Data mining; Educational institutions; Feature extraction; Hidden Markov models; Music; Data mining; hidden Markov model (HMM); music genre classification; time-constrained sequential pattern (TSP);
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2011.2172426
Filename
6047569
Link To Document