مرکز منطقه ای اطلاع رساني علوم و فناوري - Distortion Estimation in Compressed Music Using Only Audio Fingerprints

DocumentCode :

1037608

Title :

Distortion Estimation in Compressed Music Using Only Audio Fingerprints

Author :

Doets, Peter Jan O ; Lagendijk, Reginald L.

Author_Institution :

Fac. of Electr. Eng., Math., & Comput. Sci., Delft Univ. of Technol., Delft

Volume :

Issue :

fYear :

2008

Firstpage :

302

Lastpage :

317

Abstract :

An audio fingerprint is a compact yet very robust representation of the perceptually relevant parts of an audio signal. It can be used for content-based audio identification, even when the audio is severely distorted. Audio compression changes the fingerprint slightly. We show that these small fingerprint differences due to compression can be used to estimate the signal-to-noise ratio (SNR) of the compressed audio file compared to the original. This is a useful content-based distortion estimate, when the original, uncompressed audio file is unavailable. The method uses the audio fingerprints only. For stochastic signals distorted by additive noise, an analytical expression is obtained for the average fingerprint difference as function of the SNR level. This model is based on an analysis of the Philips robust hash (PRH) algorithm. We show that for uncorrelated signals, the bit error rate (BER) is approximately inversely proportional to the square root of the SNR of the signal. This model is extended to correlated signals and music. For an experimental verification of our proposed model, we divide the field of audio fingerprinting algorithms into three categories. From each category, we select an algorithm that is representative for that category. Experiments show that the behavior predicted by the stochastic model for the PRH also holds for the two other algorithms.

Keywords :

audio coding; cryptography; data compression; error statistics; stochastic processes; BER; Philips robust hash algorithm; audio compression; audio fingerprints; audio signal representation; average fingerprint difference; bit error rate; content-based audio identification; content-based distortion estimate; music compression; signal-to-noise ratio; stochastic signal distortion; Additive noise; Algorithm design and analysis; Audio compression; Bit error rate; Distortion; Fingerprint recognition; Robustness; Signal analysis; Signal to noise ratio; Stochastic resonance; Audio fingerprinting; content-based identification; quality estimation; reduced-reference quality estimation; signal-to-noise ratio (SNR) estimation; stochastic model;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2007.911716

Filename :

4432636

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1037608