• DocumentCode
    982185
  • Title

    A Mid-Level Representation for Melody-Based Retrieval in Audio Collections

  • Author

    Marolt, Matija

  • Author_Institution
    Fac. of Comput. & Inf. Sci., Univ. of Ljubljana, Ljubljana
  • Volume
    10
  • Issue
    8
  • fYear
    2008
  • Firstpage
    1617
  • Lastpage
    1625
  • Abstract
    Searching audio collections using high-level musical descriptors is a difficult problem, due to the lack of reliable methods for extracting melody, harmony, rhythm, and other such descriptors from unstructured audio signals. In this paper, we present a novel approach to melody-based retrieval in audio collections. Our approach supports audio, as well as symbolic queries and ranks results according to melodic similarity to the query. We introduce a beat-synchronous melodic representation consisting of salient melodic lines, which are extracted from the analyzed audio signal. We propose the use of a 2D shift-invariant transform to extract shift-invariant melodic fragments from the melodic representation and demonstrate how such fragments can be indexed and stored in a song database. An efficient search algorithm based on locality-sensitive hashing is used to perform retrieval according to similarity of melodic fragments. On the cover song detection task, good results are achieved for audio, as well as for symbolic queries, while fast retrieval performance makes the proposed system suitable for retrieval in large databases.
  • Keywords
    audio databases; feature extraction; music; query processing; transforms; very large databases; audio collection; beat-synchronous melodic representation; large database; locality-sensitive hashing; melodic similarity; melody extraction; melody-based retrieval; musical descriptor; search algorithm; shift-invariant transform; song database; song detection; symbolic query; Audio collections; information retrieval; melody; music;
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2008.2007293
  • Filename
    4668511