• DocumentCode
    2010736
  • Title

    An audio retrieval method based on chromagram and distance metrics

  • Author

    Yu, Xiaoqing ; Zhang, Jing ; Liu, Junwei ; Wan, Wanggen ; Yang, Wei

  • Author_Institution
    Sch. of Commun. & Inf. Eng., Shanghai Univ., Shanghai, China
  • fYear
    2010
  • fDate
    23-25 Nov. 2010
  • Firstpage
    425
  • Lastpage
    428
  • Abstract
    In this paper, a content-based audio retrieval method is proposed, which can quickly detect and locate similar sound in audio database. We extract a chroma-based audio feature: chromagram, a variation on time-frequency distributions, which represents the spectral energy at each of 12 pitch classes. Compared with traditional feature MFCC (Mel Frequency Cesptral Coefficient), chromagram is better when using correlation distance as audio similarity measurement. Then we choose Jonathan Foote´s music retrieval database to do experiments and final results show that the retrieval accuracy can reach over 96.7% using chromagram as features even when the signal-to-noise ratio is 0 dB.
  • Keywords
    audio databases; audio signal processing; content-based retrieval; Jonathan Foote music retrieval database; Mel frequency cesptral coefficient; audio database; audio similarity measurement; chroma-based audio feature; chromagram; content-based audio retrieval; distance metrics; Accuracy; Correlation; Databases; Feature extraction; Measurement; Mel frequency cepstral coefficient; Signal to noise ratio;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Audio Language and Image Processing (ICALIP), 2010 International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4244-5856-1
  • Type

    conf

  • DOI
    10.1109/ICALIP.2010.5684543
  • Filename
    5684543