DocumentCode
489
Title
Parametric Audio Coding With Exponentially Damped Sinusoids
Author
Derrien, Olivier ; Badeau, Roland ; Richard, Guilhem
Author_Institution
Lab. de Mec. et d´Acoust., Marseille, France
Volume
21
Issue
7
fYear
2013
fDate
Jul-13
Firstpage
1489
Lastpage
1501
Abstract
Sinusoidal modeling is one of the most popular techniques for low bitrate audio coding. Usually, the sinusoidal parameters (amplitude, pulsation and phase of each sinusoidal component) are kept constant within a time segment. An alternative model, the so-called Exponentially-Damped Sinusoidal (EDS) model, includes an additional damping parameter for each sinusoidal component to better represent the signal characteristics. It was however never shown that the EDS model could be efficient for perceptual audio coding. To that aim, we propose in this paper an efficient analysis/synthesis framework with dynamic time-segmentation on transients and psychoacoustic modeling, and an asymptotically optimal entropy-constrained quantization method for the four sinusoid parameters (e.g., including damping). We then apply this coding technique to real audio excerpts for a given entropy target corresponding to a low bitrate (20 kbits/s), and compare this method with a classical sinusoidal coding scheme using a constant-amplitude sinusoidal model and the perceptually weighted Matching Pursuit algorithm. Subjective listening tests show that the EDS model is more efficient on audio samples with fast transient content, and similar to the classical model for more stationary audio samples.
Keywords
acoustic signal processing; audio coding; damping; entropy; iterative methods; quantisation (signal); signal representation; transient analysis; EDS modelling; asymptotically optimal entropy constrained quantization method; constant amplitude sinusoidal model; damping parameter; dynamic time segmentation; entropy target; exponentially damped sinusoidal; parametric audio coding; perceptual audio coding; perceptually weighted matching pursuit algorithm; psychoacoustic modeling; signal representation; stationary audio sample; transient content; transient modeling; Exponentially damped sinusoids; entropy; parametric audio coding; quantization;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2013.2255284
Filename
6490016
Link To Document