• DocumentCode
    1314993
  • Title

    Beyond Timbral Statistics: Improving Music Classification Using Percussive Patterns and Bass Lines

  • Author

    Tsunoo, Emiru ; Tzanetakis, George ; Ono, Nobutaka ; Sagayama, Shigeki

  • Author_Institution
    Grad. Sch. of Infor mation Sci. & Technol., Univ. of Tokyo, Tokyo, Japan
  • Volume
    19
  • Issue
    4
  • fYear
    2011
  • fDate
    5/1/2011 12:00:00 AM
  • Firstpage
    1003
  • Lastpage
    1014
  • Abstract
    This paper discusses a new approach for clustering sequences of bar-long percussive and bass-line patterns in audio music collections and its application to genre classification. Many musical genres and styles are characterized by two kinds of distinct representative patterns, i.e., percussive patterns and bass-line patterns. So far, in most automatic genre classification systems, rhythmic and bass melody information has not been effectively used. In order to extract bar-long unit rhythmic patterns for a music collection, we propose a clustering method based on one-pass dynamic programming and k-means clustering. For clustering bass-line patterns, a method based on k -means clustering capable of handling pitch-shifting is proposed. After extracting these two fundamental kinds of patterns for each style/genre, feature vectors which are suitable for representing information about the patterns are proposed for supervised learning. Experimental results show that the automatically calculated rhythmic pattern information and bass pattern information can be used to effectively classify musical genre/style and improve upon current approaches based on timbral features.
  • Keywords
    dynamic programming; feature extraction; learning (artificial intelligence); music; musical acoustics; pattern clustering; audio music collections; automatically calculated rhythmic pattern information; bass lines; bass pattern information; clustering sequences; genre classification; k-means clustering; music classification; one-pass dynamic programming; percussive patterns; pitch-shifting; supervised learning; timbral statistics; $k$-means clustering; Dynamic programming; feature extraction; musical genre classification; pattern clustering method;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2010.2073706
  • Filename
    5565447