• DocumentCode
    1879451
  • Title

    Automatic music genre classification using timbral texture and rhythmic content features

  • Author

    Baniya, Babu Kaji ; Ghimire, Deepak ; Joonwhoan Lee

  • Author_Institution
    Div. of Comput. Sci. & Eng., Chonbuk Nat. Univ., Jeonju, South Korea
  • fYear
    2015
  • fDate
    1-3 July 2015
  • Firstpage
    434
  • Lastpage
    443
  • Abstract
    Music genre classification is a vital component for the music information retrieval system. There are two important components to be considered for better genre classification, which are audio feature extraction and classifier. This paper incorporates two different kinds of features for genre classification, timbral texture and rhythmic content features. Timbral texture contains the Mel-frequency Cepstral Coefficient (MFCC) with other several spectral features. Before choosing a timbral feature we explore which feature contributes a less significant role on genre discrimination. This facilitates the reduction of feature dimension. For the timbral features up to the 4-th order central moments and the covariance components of mutual features are considered to improve the overall classification result. For the rhythmic content the features extracted from beat histogram are selected. In the paper Extreme Learning Machine (ELM) with bagging is used as the classifier for classifying the genres. Based on the proposed feature sets and classifier, experiments are performed with two well-known datasets: GTZAN and the ISMIR2004 databases with ten and six different music genres, respectively. The proposed method acquires better and competitive classification accuracy compared to the existing approaches for both data sets.
  • Keywords
    audio signal processing; feature extraction; information retrieval systems; learning (artificial intelligence); music; signal classification; ELM with bagging; GTZAN database; ISMIR2004 database; MFCC; Mel-frequency cepstral coefficient; audio feature classifier; audio feature extraction; classification accuracy; extreme learning machine; feature dimension reduction; genre discrimination; music genre classification; music information retrieval system; rhythmic content feature; timbral texture; Bagging; Feature extraction; Histograms; Mel frequency cepstral coefficient; Speech; Standards; Classification; ELM (Extreme Learning Machine) with bagging; covariance matrix; music genres; rhythmic contents; timbral texture;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Communication Technology (ICACT), 2015 17th International Conference on
  • Conference_Location
    Seoul
  • Print_ISBN
    978-8-9968-6504-9
  • Type

    conf

  • DOI
    10.1109/ICACT.2015.7224907
  • Filename
    7224907