• DocumentCode
    542652
  • Title

    Psycho-acoustic modeling of audio with exponentially damped sinusoids

  • Author

    Hermus, Kris ; Verhelst, Werner ; Wambacq, Patrick

  • Author_Institution
    Lab. of Processing Speech and Images (PSI), Dept. of Electrical Engineering - ESAT, Katholieke Universiteit Leuven, Belgium
  • Volume
    2
  • fYear
    2002
  • fDate
    13-17 May 2002
  • Abstract
    While a traditional sinusoidal model is capable of representing audio segments, a sum of exponentially damped sinusoids is more efficient to model the transient segments that are readily found in audio signals. In this paper, Total Least Squares (TLS) algorithms are applied to automatically extract the modeling parameters in the Exponential Sinusoidal Model (ESM). In order to turn the SNR . optimization criterion of these TLS algorithms into a perceptual modeling strategy we incorporate the psycho-acoustic model of MPEG 1 - Layer 1 into a subband TLS-ESM scheme. This allows us to model each subband in accordance with its perceptual relevance. Informal listening tests confirm that perceptual ESM achieves the same perceived quality as plain ESM while using substantially less components.
  • Keywords
    Lead; Signal to noise ratio;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
  • Conference_Location
    Orlando, FL, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2002.5744978
  • Filename
    5744978