• DocumentCode
    3570171
  • Title

    Multi-label annotation of music

  • Author

    Ahsan, Hiba ; Kumar, Vijay ; Jawahar, C.V.

  • Author_Institution
    Nat. Inst. of Technol. Karnataka, Mangalore, India
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Automatic annotation of an audio or a music piece with multiple labels helps in understanding the composition of a music. Such meta-level information can be very useful in applications such as music transcription, retrieval, organization and personalization. In this work, we formulate the problem of annotation as multi-label classification which is considerably different from that of a popular single (binary or multi-class) label classification. We employ both the nearest neighbour and max-margin (SVM) formulations for the automatic annotation. We consider K-NN and SVM that are adapted for multi-label classification using one-vs-rest strategy and a direct multi-label classification formulation using ML-KNN and M3L. In the case of music, often the signatures of the labels (e.g. instruments and vocal signatures) are fused in the features. We therefore propose a simple feature augmentation technique based on non-negative matrix factorization (NMF) with an intuition to decompose a music piece into its constituent components. We conducted our experiments on two data sets - Indian classical instruments dataset and Emotions dataset [1], and validate the methods.
  • Keywords
    matrix decomposition; multimedia computing; music; pattern classification; support vector machines; Emotions dataset; Indian classical instruments; M3L; ML-KNN; NMF; SVM; automated multilabel music annotation; direct multilabel classification formulation; feature augmentation technique; max-margin formulation; meta level information; nearest neighbour formulation; non-negative matrix factorization; one-vs-rest strategy; Context; Feature extraction; Instruments; Matrix decomposition; Music; Support vector machines; Training; Music annotation; multi-label classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advances in Pattern Recognition (ICAPR), 2015 Eighth International Conference on
  • Type

    conf

  • DOI
    10.1109/ICAPR.2015.7050685
  • Filename
    7050685