• DocumentCode
    3421220
  • Title

    Perceptually-weighted audio coding that scales to extremely low bitrates

  • Author

    Kandadai, Srivatsan ; Creusere, Charles D.

  • Author_Institution
    Klipsch Sch. of Electr. & Comput. Eng., New Mexico State Univ., Las Cruces, NM, USA
  • fYear
    2006
  • fDate
    28-30 March 2006
  • Firstpage
    382
  • Lastpage
    391
  • Abstract
    A perceptually scalable audio coder generates a bit-stream that contains layers of audio fidelity and is encoded in such a way that adding one of these layers enhances the reconstructed audio by an amount that is just noticeable by the listener. Such algorithms have applications like music on demand at variable levels of fidelity for 3G and 4G cellular radio systems operating at different bit rates. While the MPEG-4 natural audio coder can create scalable bit streams, its perceptual quality at low bit rates is poor. On the other hand, the non scalable TWIN-VQ performs well at low bit rates. In this paper we present a technique to modify the TWIN-VQ algorithm such that it generates a perceptually scalable bit-stream with layers of audio fidelity. Using the TWIN-VQ as our base ensures the best possible perceptual quality at low bit rates (8 - 16 kbps).
  • Keywords
    audio coding; vector quantisation; 3G cellular radio systems; 4G cellular radio systems; 8 to 16 Kbit/s; MPEG-4 natural audio coder; TWIN-VQ algorithm; audio fidelity; perceptual quality; perceptually-weighted audio coding; scalable bit streams; Audio coding; Bit rate; Channel capacity; Communication channels; Digital communication; Land mobile radio cellular systems; MPEG 4 Standard; Psychology; Scalability; Streaming media;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Compression Conference, 2006. DCC 2006. Proceedings
  • ISSN
    1068-0314
  • Print_ISBN
    0-7695-2545-8
  • Type

    conf

  • DOI
    10.1109/DCC.2006.65
  • Filename
    1607273