DocumentCode
1314993
Title
Beyond Timbral Statistics: Improving Music Classification Using Percussive Patterns and Bass Lines
Author
Tsunoo, Emiru ; Tzanetakis, George ; Ono, Nobutaka ; Sagayama, Shigeki
Author_Institution
Grad. Sch. of Infor mation Sci. & Technol., Univ. of Tokyo, Tokyo, Japan
Volume
19
Issue
4
fYear
2011
fDate
5/1/2011 12:00:00 AM
Firstpage
1003
Lastpage
1014
Abstract
This paper discusses a new approach for clustering sequences of bar-long percussive and bass-line patterns in audio music collections and its application to genre classification. Many musical genres and styles are characterized by two kinds of distinct representative patterns, i.e., percussive patterns and bass-line patterns. So far, in most automatic genre classification systems, rhythmic and bass melody information has not been effectively used. In order to extract bar-long unit rhythmic patterns for a music collection, we propose a clustering method based on one-pass dynamic programming and k-means clustering. For clustering bass-line patterns, a method based on k -means clustering capable of handling pitch-shifting is proposed. After extracting these two fundamental kinds of patterns for each style/genre, feature vectors which are suitable for representing information about the patterns are proposed for supervised learning. Experimental results show that the automatically calculated rhythmic pattern information and bass pattern information can be used to effectively classify musical genre/style and improve upon current approaches based on timbral features.
Keywords
dynamic programming; feature extraction; learning (artificial intelligence); music; musical acoustics; pattern clustering; audio music collections; automatically calculated rhythmic pattern information; bass lines; bass pattern information; clustering sequences; genre classification; k-means clustering; music classification; one-pass dynamic programming; percussive patterns; pitch-shifting; supervised learning; timbral statistics; $k$ -means clustering; Dynamic programming; feature extraction; musical genre classification; pattern clustering method;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2010.2073706
Filename
5565447
Link To Document