Title :
Audio genre classification using percussive pattern clustering combined with timbral features
Author :
Tsunoo, Emiru ; Tzanetakis, George ; Ono, Nobutaka ; Sagayama, Shigeki
Author_Institution :
Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Tokyo, Japan
fDate :
June 28 2009-July 3 2009
Abstract :
Many musical genres and styles are characterized by distinct representative rhythmic patterns. In most automatic genre classification systems global statistical features based on timbral dynamics such as mel-frequency cepstral coefficients (MFCC) are utilized but so far rhythmic information has not so effectively been used. In order to extract bar-long unit rhythmic patterns for a music collection we propose a clustering method based on one-pass dynamic programming and k-means clustering. After extracting the fundamental rhythmic patterns for each style/genre a pattern occurrence histogram is calculated and used as a feature vector for supervised learning. Experimental results show that the automatically calculated rhythmic pattern information can be used to effectively classify musical genre/style and improve upon current approaches based on timbral features.
Keywords :
audio acoustics; audio signal processing; dynamic programming; feature extraction; information retrieval; musical acoustics; pattern clustering; signal classification; audio genre classification; feature vector; global statistical features; k-means clustering; mel-frequency cepstral coefficients; one-pass dynamic programming; percussive pattern clustering; rhythmic pattern information; supervised learning; timbral features; Clustering algorithms; Data mining; Dynamic programming; Feature extraction; Histograms; Iterative algorithms; Multiple signal classification; Pattern clustering; Rhythm; Spectrogram; Audio genre classification; Dynamic programming; Feature extraction; Pattern clustering method; Percussive sound;
Conference_Titel :
Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on
Conference_Location :
New York, NY
Print_ISBN :
978-1-4244-4290-4
Electronic_ISBN :
1945-7871
DOI :
10.1109/ICME.2009.5202514