Title :
A two-layered trellis approach to audio encoding
Author :
Melkote, Vinay ; Rose, Kenneth
Author_Institution :
ECE Dept., Univ. of California, Santa Barbara, CA
fDate :
March 31 2008-April 4 2008
Abstract :
The fact that audio compression for streaming or storage is usually performed offline alleviates traditional constraints on encoding delay. We propose a rate-distortion optimized approach, within the MPEG Advanced Audio Coding framework, to trade delay for optimal window switching, resource allocation and selection of quantization and coding parameters for the entire audio file using a two-layered trellis. Stages of the outer trellis correspond to audio frames, nodes represent window choices, and branches implement transition constraints. The inner trellis operates within each node of the outer layer and has stages corresponding to scalefactor bands and nodes representing combinations of quantization and coding parameters. A suitable cost, comprising bit consumption and psychoacoustic distortion, is optimized via multiple passes through the two-layered trellis to achieve the desired bitrate. The procedure thus optimizes most of the encoding decisions involved in audio compression. Objective and subjective tests show considerable performance gains.
Keywords :
audio coding; audio signals; MPEG; audio coding; audio compression; audio encoding; audio file; audio frame; encoding delay; optimal window switching; psychoacoustic distortion; quantization selection; resource allocation; two-layered trellis; Audio coding; Audio compression; Cost function; Delay; Encoding; Quantization; Rate-distortion; Resource management; Streaming media; Transform coding; AAC; Audio coding; trellis optimization; window switching;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4517581