• DocumentCode
    1351840
  • Title

    Discovering Time-Constrained Sequential Patterns for Music Genre Classification

  • Author

    Ren, Jia-Min ; Jang, Jyh-Shing Roger

  • Author_Institution
    Dept. of Comput. Sci., Nat. Tsing Hua Univ., Hsinchu, Taiwan
  • Volume
    20
  • Issue
    4
  • fYear
    2012
  • fDate
    5/1/2012 12:00:00 AM
  • Firstpage
    1134
  • Lastpage
    1144
  • Abstract
    A music piece can be considered as a sequence of sound events which represent both short-term and long-term temporal information. However, in the task of automatic music genre classification, most of text-categorization-based approaches could only capture temporal local dependencies (e.g., unigram and bigram-based occurrence statistics) to represent music contents. In this paper, we propose the use of time-constrained sequential patterns (TSPs) as effective features for music genre classification. First of all, an automatic language identification technique is performed to tokenize each music piece into a sequence of hidden Markov model indices. Then TSP mining is applied to discover genre-specific TSPs, followed by the computation of occurrence frequencies of TSPs in each music piece. Finally, support vector machine classifiers are employed based on these occurrence frequencies to perform the classification task. Experiments conducted on two widely used datasets for music genre classification, GTZAN and ISMIR2004Genre, show that the proposed method can discover more discriminative temporal structures and achieve a better recognition accuracy than the unigram and bigram-based statistical approach.
  • Keywords
    data mining; feature extraction; hidden Markov models; music; natural languages; pattern classification; text analysis; GTZAN; ISMIR2004Genre; automatic language identification technique; automatic music genre classification; bigram-based statistical approach; discriminative temporal structure; genre-specific TSP mining; hidden Markov model index; music content; music piece; sound events sequence; support vector machine classifier; temporal information; temporal local dependency; text-categorization-based approach; time-constrained sequential pattern discovery; Computer science; Data mining; Educational institutions; Feature extraction; Hidden Markov models; Music; Data mining; hidden Markov model (HMM); music genre classification; time-constrained sequential pattern (TSP);
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2011.2172426
  • Filename
    6047569