DocumentCode
23087
Title
An Efficient Cascaded Filtering Retrieval Method for Big Audio Data
Author
Shanshan Yao ; Yunsheng Wang ; Baoning Niu
Author_Institution
Dept. of Comput. Sci. & Technol., Taiyuan Univ. of Technol., Taiyuan, China
Volume
17
Issue
9
fYear
2015
fDate
Sept. 2015
Firstpage
1450
Lastpage
1459
Abstract
Fast audio retrieval is crucial for many important applications and yet demanding due to the high dimension nature and increasingly larger volume of audios on the Internet. Although audio fingerprinting can greatly reduce its dimension while keeping audio identifiable, the dimension for audio fingerprints is still too high to scale up for big audio data. The tradeoff between accuracy (measured by precision and recall rate) and efficiency (measured by retrieval time) prevents further reduction in the dimension of fingerprints. This paper shows that a multi-stage filtering strategy can achieve both speedup and high accuracy, with the beginning stages focusing on speedup and the end stage emphasizing accuracy. With this strategy, an efficient cascaded filtering retrieval method is proposed that consists of filtering with Fibonacci hashing, the middle fingerprint, thresholds to quickly select candidate audios, and refining with an accurate and robust fingerprint on the candidate audios. Experiments with 500 000 audios show that the proposed method can achieve a speed gain more than 28 K times that of the Fibonacci hashing retrieval. After applying MP3 conversion, resampling, white noise addition, and background noise addition, the recall rates of the method are all above 99.45%, and the precision is the same as the Philips audio fingerprint, which is close to 100%.
Keywords
Internet; audio signal processing; filtering theory; information retrieval; Fibonacci hashing retrieval; Internet; MP3 conversion; audio fingerprinting; big audio data; efficient cascaded filtering retrieval method; multistage filtering strategy; Accuracy; Algorithm design and analysis; Audio databases; Filtering; Fingerprint recognition; Robustness; Audio middle fingerprint; Philips audio fingerprint; big audio data; cascade filtering retrieval; content-based retrieval;
fLanguage
English
Journal_Title
Multimedia, IEEE Transactions on
Publisher
ieee
ISSN
1520-9210
Type
jour
DOI
10.1109/TMM.2015.2460121
Filename
7165676
Link To Document