DocumentCode :
3558655
Title :
Union of MDCT Bases for Audio Coding
Author :
Ravelli, Emmanuel ; Richard, Gal ; Daudet, Laurent
Author_Institution :
Inst. Jean le Rond d´´Alembert-LAM, Univ. Pierre et Marie Curie-Paris, Paris
Volume :
16
Issue :
8
fYear :
2008
Firstpage :
1361
Lastpage :
1372
Abstract :
This paper investigates the use of sparse overcomplete decompositions for audio coding. Audio signals are decomposed over a redundant union of modified discrete cosine transform (MDCT) bases having eight different scales. This approach produces a sparser decomposition than the traditional MDCT-based orthogonal transform and allows better coding efficiency at low bitrates. Contrary to state-of-the-art low bitrate coders, which are based on pure parametric or hybrid representations, our approach is able to provide transparency. Moreover, we use a bitplane encoding approach, which provides a fine-grain scalable coder that can seamlessly operate from very low bitrates up to transparency. Objective evaluation, as well as listening tests, show that the performance of our coder is significantly better than a state-of-the-art transform coder at very low bitrates and has similar performance at high bitrates. We provide a link to test soundfiles and source code to allow better evaluation and reproducibility of the results.
Keywords :
audio coding; discrete cosine transforms; reliability; source coding; audio coding; bitplane encoding approach; coding efficiency; fine-grain scalable coder; modified discrete cosine transform; objective evaluation; orthogonal transform; source code; sparse overcomplete decompositions; state-of-the-art low bit rate coders; Acoustic testing; Audio coding; Bit rate; Discrete cosine transforms; Discrete transforms; MPEG 4 Standard; Matching pursuit algorithms; Parametric statistics; Reproducibility of results; Signal representations; Audio coding; matching pursuit; scalable coding; signal representations; sparse representations;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2008.2004290
Filename :
4648210
Link To Document :
بازگشت