• DocumentCode
    1414385
  • Title

    Incorporating Cultural Representations of Features Into Audio Music Similarity Estimation

  • Author

    West, Kris ; Cox, Stephen

  • Author_Institution
    Sch. of Comput. Sci., Univ. of East Anglia, Norwich, UK
  • Volume
    18
  • Issue
    3
  • fYear
    2010
  • fDate
    3/1/2010 12:00:00 AM
  • Firstpage
    625
  • Lastpage
    637
  • Abstract
    We address the problem of estimating automatically from audio signals the similarity between two pieces of music, a technology that has many applications in the online digital music industry. Conventional methods of audio music search use distance measures between features derived from the audio for this task. We describe three techniques that make use of music classifiers to derive representations of audio features that are based on culturally motivated information learned by the classifier. When these representations are used for similarity estimation, they produce very significant reductions in computational complexity over existing techniques (such as those based on the KL-divergence), and also produce metric similarity spaces, which facilitate the use of technologies for the sub-linear scaling of search times. We have evaluated each system using both pseudo-objective techniques and human listeners, and we demonstrate that this efficiency gain is obtained while providing a comparable level of performance when compared with existing techniques.
  • Keywords
    audio signal processing; information retrieval; music; audio music search; audio music similarity estimation; cultural representation; metric similarity space; music classifier; music information retrieval; pseudoobjective technique; sublinear scaling; Collaboration; Computational complexity; Cultural differences; Databases; Filtering; Humans; Multiple signal classification; Music information retrieval; Performance gain; Space technology; Music information retrieval (MIR); music semantics; music similarity estimation;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2009.2025533
  • Filename
    5410074