• DocumentCode
    409619
  • Title

    Quantifying perceptual distortion in scalably compressed MPEG audio

  • Author

    Creusere, Charles D.

  • Author_Institution
    Klipsch Sch. of Electr. & Comput. Eng., New Mexico State Univ., Las Cruces, NM, USA
  • Volume
    1
  • fYear
    2003
  • fDate
    9-12 Nov. 2003
  • Firstpage
    265
  • Abstract
    A scalably compressed bitstream is one which can be streamed and decoded at a wide variety of bitrates, and it is therefore compatible with communications channels of varying capacity. The audio coding portions of the MPEG 2 and 4 standards support fine-grained scalability through the use of bit slice arithmetic coding (BSAC). Human subjective analysis of BSAC, however, has shown that it performs poorly at low bitrates; seemingly random tonal patterns are superimposed on the actual audio. Here, we develop a new approach for objectively characterizing such distortion and validate it with human subjective trials. Unlike most other objective performance metrics, the proposed approach does not require sample-accurate sequence synchronization. As a comparison, we also apply the ITU-R BS.1387-1 objective testing recommendation to the same audio sequences and quantify how well it predicts the observed subjective quality.
  • Keywords
    arithmetic codes; audio coding; channel capacity; data compression; decoding; distortion; MPEG audio; audio coding; audio sequences; bit slice arithmetic coding; channels capacity; energy equalization; perceptual distortion; scalably compressed bitstream; tonal patterns; Arithmetic; Audio coding; Bit rate; Channel capacity; Communication channels; Decoding; Humans; Scalability; Streaming media; Transform coding;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signals, Systems and Computers, 2004. Conference Record of the Thirty-Seventh Asilomar Conference on
  • Print_ISBN
    0-7803-8104-1
  • Type

    conf

  • DOI
    10.1109/ACSSC.2003.1291911
  • Filename
    1291911