• DocumentCode
    2159971
  • Title

    A Study of Multi-dimensional Melodic Similarity Model Based on Perceptual Analysis

  • Author

    Xu, Jieping ; Zhao, Yang ; Liu, Yi

  • Volume
    5
  • fYear
    2008
  • fDate
    27-30 May 2008
  • Firstpage
    78
  • Lastpage
    82
  • Abstract
    Perceived predominant melody of music is the most convenient and memorable description and can be used for content-based music retrieval. However, including human voice and multiple musical instruments playing together, it is difficult to extract a predominant contour of pitches directly from MP3 music recordings. In order to build a similarity music melody model, several audio files, whose melody is perceptually similar, are collected in our experiment. Many features are extracted and a melodic similarity model is defined through analyzing each feature and combinations of them. The melody model is evaluated based on classification results of six categories of Chinese folk music using Support Vector Machine. The experiment results show that 36-dimensional Constant Q Transform (CQT) feature can represent the melody of audio music pieces accurately. Further more, the classification results for audio data with similar melody are good enough to be used in audio music classification or segmentation and subsequently are very helpful in music information retrieval system.
  • Keywords
    Content based retrieval; Data mining; Digital audio players; Human voice; Instruments; Internet; Multiple signal classification; Music information retrieval; Support vector machine classification; Support vector machines; Constant Q Transform (CQT); Perception Analysis; Pitch Class Profile (PCP); Similarity Matrix;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image and Signal Processing, 2008. CISP '08. Congress on
  • Conference_Location
    Sanya, China
  • Print_ISBN
    978-0-7695-3119-9
  • Type

    conf

  • DOI
    10.1109/CISP.2008.554
  • Filename
    4566790