DocumentCode
2935104
Title
Perceptually-Driven Scalable MDCT Enhancement of Compressed Audio Based on Statistical Conversion
Author
Cantzos, Demetrios ; Mouchtaris, Athanasios ; Kyriakakis, Chris
Author_Institution
Dept. of Autom. Eng., Technol. Educ. Inst. of Piraeus (TEI Piraeus), Athens, Greece
fYear
2011
fDate
5-7 Dec. 2011
Firstpage
41
Lastpage
46
Abstract
Many state-of-the-art audio codecs operating in a transform domain provide scalability as a core function by allowing to selectively subtract bits -- usually according to a nonperceptual criterion from the full bit rate data stream. This work presents a different, or even reverse, scalability approach in which a scalable codec can selectively add perceptually significant bits to a low bit rate data stream. The scalable enhancement algorithm presented here operates in the Modified Discrete Cosine Transform domain, which is popular among perceptual audio transform encoders, but its extension on other domains is straightforward. By exploiting the information of an existing low bit rate base layer, the algorithm adds perceptually significant data to the data stream according to a psycho acoustic model, and improves the audio quality at a fraction of the bit rate that would normally be required for the encoding or transmission of the whole audio piece of the same quality. Applications of this can be found in packet retransmission schemes of compressed audio networks and in remote audio enhancement.
Keywords
audio coding; discrete cosine transforms; statistical analysis; audio codecs; audio transform encoders; compressed audio networks; full bit rate data stream; modified discrete cosine transform domain; nonperceptual criterion; perceptually-driven scalable MDCT enhancement; psycho acoustic model; remote audio enhancement; scalability approach; scalable codec; scalable enhancement algorithm; statistical conversion; Bit rate; Decoding; Nuclear magnetic resonance; Psychoacoustic models; Scalability; Sorting; Vectors; Audio coding; MDCT; conversion; enhancement; psychoacoustic model; scalability;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia (ISM), 2011 IEEE International Symposium on
Conference_Location
Dana Point CA
Print_ISBN
978-1-4577-2015-4
Type
conf
DOI
10.1109/ISM.2011.16
Filename
6123323
Link To Document