• DocumentCode
    2991804
  • Title

    The waveform segment vocoder: A new approach for very-low-rate speech coding

  • Author

    Roucos, Salim ; Wilgus, Alexander M.

  • Author_Institution
    Bolt Beranek and Newman Inc., Cambridge, MA
  • Volume
    10
  • fYear
    1985
  • fDate
    31138
  • Firstpage
    236
  • Lastpage
    239
  • Abstract
    We propose a new method of synthesis to be used for the segment vocoder, which transmits intelligible speech at rates below 300 b/s. The earlier segment vocoder applies LPC analysis to input speech, divides it into segments of variable duration, matches each segment with the nearest template from a codebook, concatenates at the receiver the set of nearest templates, and finally synthesizes the resultant sequence of speech frames using LPC synthesis. The quality of such a segment vocoder cannot exceed that of a standard unquantized LPC vocoder, which sounds buzzy due to the pulse/noise excitation used. Alternatively, by beginning with the waveforms (not the spectral representation) corresponding to the set of nearest templates, we can independently modify the pitch, energy, and duration of each template to match those of the input segment. These modified segments are then concatenated to produce the output waveform. We present here methods for high-quality modification of the pitch and duration of a segment of a speech waveform and show how these methods can be applied to improve the quality of the segment vocoder´s output speech.
  • Keywords
    Acoustic noise; Concatenated codes; Fasteners; Linear predictive coding; Speech analysis; Speech coding; Speech enhancement; Speech synthesis; Splicing; Vocoders;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '85.
  • Type

    conf

  • DOI
    10.1109/ICASSP.1985.1168222
  • Filename
    1168222