Psycho-acoustic modeling of audio with exponentially damped sinusoids

Author

Hermus, Kris ; Verhelst, Werner ; Wambacq, Patrick

Author_Institution

Lab. of Processing Speech and Images (PSI), Dept. of Electrical Engineering - ESAT, Katholieke Universiteit Leuven, Belgium

Volume

fYear

2002

fDate

13-17 May 2002

Abstract

While a traditional sinusoidal model is capable of representing audio segments, a sum of exponentially damped sinusoids is more efficient to model the transient segments that are readily found in audio signals. In this paper, Total Least Squares (TLS) algorithms are applied to automatically extract the modeling parameters in the Exponential Sinusoidal Model (ESM). In order to turn the SNR . optimization criterion of these TLS algorithms into a perceptual modeling strategy we incorporate the psycho-acoustic model of MPEG 1 - Layer 1 into a subband TLS-ESM scheme. This allows us to model each subband in accordance with its perceptual relevance. Informal listening tests confirm that perceptual ESM achieves the same perceived quality as plain ESM while using substantially less components.

Keywords

Lead; Signal to noise ratio;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on

Conference_Location

Orlando, FL, USA

ISSN

1520-6149

Print_ISBN

0-7803-7402-9

Type

conf

DOI

10.1109/ICASSP.2002.5744978

Filename

5744978

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=542652