DocumentCode
2010736
Title
An audio retrieval method based on chromagram and distance metrics
Author
Yu, Xiaoqing ; Zhang, Jing ; Liu, Junwei ; Wan, Wanggen ; Yang, Wei
Author_Institution
Sch. of Commun. & Inf. Eng., Shanghai Univ., Shanghai, China
fYear
2010
fDate
23-25 Nov. 2010
Firstpage
425
Lastpage
428
Abstract
In this paper, a content-based audio retrieval method is proposed, which can quickly detect and locate similar sound in audio database. We extract a chroma-based audio feature: chromagram, a variation on time-frequency distributions, which represents the spectral energy at each of 12 pitch classes. Compared with traditional feature MFCC (Mel Frequency Cesptral Coefficient), chromagram is better when using correlation distance as audio similarity measurement. Then we choose Jonathan Foote´s music retrieval database to do experiments and final results show that the retrieval accuracy can reach over 96.7% using chromagram as features even when the signal-to-noise ratio is 0 dB.
Keywords
audio databases; audio signal processing; content-based retrieval; Jonathan Foote music retrieval database; Mel frequency cesptral coefficient; audio database; audio similarity measurement; chroma-based audio feature; chromagram; content-based audio retrieval; distance metrics; Accuracy; Correlation; Databases; Feature extraction; Measurement; Mel frequency cepstral coefficient; Signal to noise ratio;
fLanguage
English
Publisher
ieee
Conference_Titel
Audio Language and Image Processing (ICALIP), 2010 International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-4244-5856-1
Type
conf
DOI
10.1109/ICALIP.2010.5684543
Filename
5684543
Link To Document