Title :
Quantifying perceptual distortion in scalably compressed MPEG audio
Author :
Creusere, Charles D.
Author_Institution :
Klipsch Sch. of Electr. & Comput. Eng., New Mexico State Univ., Las Cruces, NM, USA
Abstract :
A scalably compressed bitstream is one which can be streamed and decoded at a wide variety of bitrates, and it is therefore compatible with communications channels of varying capacity. The audio coding portions of the MPEG 2 and 4 standards support fine-grained scalability through the use of bit slice arithmetic coding (BSAC). Human subjective analysis of BSAC, however, has shown that it performs poorly at low bitrates; seemingly random tonal patterns are superimposed on the actual audio. Here, we develop a new approach for objectively characterizing such distortion and validate it with human subjective trials. Unlike most other objective performance metrics, the proposed approach does not require sample-accurate sequence synchronization. As a comparison, we also apply the ITU-R BS.1387-1 objective testing recommendation to the same audio sequences and quantify how well it predicts the observed subjective quality.
Keywords :
arithmetic codes; audio coding; channel capacity; data compression; decoding; distortion; MPEG audio; audio coding; audio sequences; bit slice arithmetic coding; channels capacity; energy equalization; perceptual distortion; scalably compressed bitstream; tonal patterns; Arithmetic; Audio coding; Bit rate; Channel capacity; Communication channels; Decoding; Humans; Scalability; Streaming media; Transform coding;
Conference_Titel :
Signals, Systems and Computers, 2004. Conference Record of the Thirty-Seventh Asilomar Conference on
Print_ISBN :
0-7803-8104-1
DOI :
10.1109/ACSSC.2003.1291911