DocumentCode
542652
Title
Psycho-acoustic modeling of audio with exponentially damped sinusoids
Author
Hermus, Kris ; Verhelst, Werner ; Wambacq, Patrick
Author_Institution
Lab. of Processing Speech and Images (PSI), Dept. of Electrical Engineering - ESAT, Katholieke Universiteit Leuven, Belgium
Volume
2
fYear
2002
fDate
13-17 May 2002
Abstract
While a traditional sinusoidal model is capable of representing audio segments, a sum of exponentially damped sinusoids is more efficient to model the transient segments that are readily found in audio signals. In this paper, Total Least Squares (TLS) algorithms are applied to automatically extract the modeling parameters in the Exponential Sinusoidal Model (ESM). In order to turn the SNR . optimization criterion of these TLS algorithms into a perceptual modeling strategy we incorporate the psycho-acoustic model of MPEG 1 - Layer 1 into a subband TLS-ESM scheme. This allows us to model each subband in accordance with its perceptual relevance. Informal listening tests confirm that perceptual ESM achieves the same perceived quality as plain ESM while using substantially less components.
Keywords
Lead; Signal to noise ratio;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location
Orlando, FL, USA
ISSN
1520-6149
Print_ISBN
0-7803-7402-9
Type
conf
DOI
10.1109/ICASSP.2002.5744978
Filename
5744978
Link To Document