• DocumentCode
    1245790
  • Title

    Voiced speech coding at very low bit rates based on forward-backward waveform prediction

  • Author

    Yang, Gao ; Leich, Henri ; Boite, Red

  • Author_Institution
    Lernout & Hanspie Speech Products N.V., Belgium
  • Volume
    3
  • Issue
    1
  • fYear
    1995
  • fDate
    1/1/1995 12:00:00 AM
  • Firstpage
    40
  • Lastpage
    47
  • Abstract
    Techniques for coding voiced speech at very low bit rates are investigated and a new algorithm, designed to produce high quality speech with low complexity, is proposed. This algorithm encodes and transmits partial representative waveforms (RWs) from which the complete speech waveforms are reconstructed by using a method called forward-backward waveform prediction (FBWP). The RW is encoded at 20-30 ms intervals with a low complexity approach, taking into account the special initial conditions of short- and long-term filters. The basic idea of FBWP is essentially consistent with that of the prototype waveform interpolation (PWI) algorithm, which was reported to be capable of producing high-quality voiced speech at a bit rate of between 3.0 and 4.0 kb/s. By implementing the FBWP in the time domain, fast computation is thereby made possible while high-quality speech can be obtained at bit rate of about 3 kb/s. As in the PWI method, the proposed algorithm may be combined with an LP-based speech coder which uses a noise-like excitation to reproduce unvoiced speech
  • Keywords
    filtering theory; linear predictive coding; speech coding; speech intelligibility; vocoders; waveform analysis; 3 to 4 kbit/s; LP-based speech coder; PWI method; algorithm; forward-backward waveform prediction; high quality speech; initial conditions; long-term filters; low complexity; noise-like excitation; prototype waveform interpolation; representative waveforms; short-term filters; speech waveforms; time domain; unvoiced speech; very low bit rates; voiced speech coding; Algorithm design and analysis; Bit rate; Distortion; Filters; Helium; Quantization; Reverberation; Signal generators; Speech coding; Vocoders;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.365382
  • Filename
    365382