Title :
Audio Fingerprinting: Combining Computer Vision & Data Stream Processing
Author :
Baluja, Shumeet ; Covell, Michele
Author_Institution :
Google Inc., Mountain View, CA
Abstract :
In this paper, we present waveprint, a novel system for audio identification. Waveprint uses a combination of computer-vision techniques and large-scale-data-stream processing algorithms to create compact fingerprints of audio data that can be efficiently matched. The resulting system has excellent identification capabilities for small snippets of audio that have been degraded in a variety of manners, including competing noise, poor recording quality, and cell-phone playback. We measure the tradeoffs between performance, memory usage, and computation through extensive experimentation. The system is more efficient in terms of memory usage and computation, while being more accurate, when compared with previous state of the art systems.
Keywords :
audio signal processing; computer vision; audio fingerprinting; cell-phone playback; compact fingerprints; competing noise; computer vision; data stream processing; memory usage; waveprint; Acoustic noise; Computer vision; Degradation; Fingerprint recognition; Frequency; Image retrieval; Large-scale systems; Signal processing algorithms; Spectrogram; Streaming media; Acoustic Applications; Acoustic Signal Processing; Music; Pattern Recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2007.366210