• DocumentCode
    1224319
  • Title

    Bandwidth Extension for Hierarchical Speech and Audio Coding in ITU-T Rec. G.729.1

  • Author

    Geiser, Bernd ; Jax, Peter ; Vary, Peter ; Taddei, Hervé ; Schandl, Stefan ; Gartner, Martin ; Guillaumé, Cyril ; Ragot, Stéphane

  • Author_Institution
    Inst. of Commun. Syst. & Data Process. (IND), RWTH Aachen Univ., Aachen
  • Volume
    15
  • Issue
    8
  • fYear
    2007
  • Firstpage
    2496
  • Lastpage
    2509
  • Abstract
    Recommendation G.729.1 is a new ITU-T standard which was approved in May 2006. This recommendation describes a hierarchical speech and audio coding algorithm built on top of a narrowband core codec. One challenge in the codec design is the generation of a wideband signal with a very limited additional bit rate (less than 2 kb/s). In this paper, we describe the respective codec layer, which extends the transmitted acoustic bandwidth from the narrowband frequency range (50 Hz-4 kHz) to the wideband frequency range (50 Hz-7 kHz). The underlying algorithm uses a fairly coarse parametric description of the temporal and spectral energy envelopes of the high frequency band (4-7 kHz). This parameter set is quantized with a bit rate of 1.65 kb/s. At the decoder side, the high-frequency components are regenerated by appropriately shaping a synthetically generated ldquoexcitation signal.rdquo Apart from the algorithmic description and a discussion, we state a complexity evaluation as well as some listening test results.
  • Keywords
    acoustic signal processing; audio coding; quantisation (signal); spectral analysis; speech codecs; speech coding; ITU-T Rec. G.729.1 standard; acoustic bandwidth extension; audio coding; hierarchical speech coding; narrowband core codec; parameter set quantization; temporal-spectral energy envelope; Audio coding; Bandwidth; Bit rate; Frequency; Narrowband; Signal design; Signal generators; Speech codecs; Speech coding; Wideband; Bandwidth extension; hierarchical bitstream organization; wideband speech coding;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2007.907330
  • Filename
    4317562