• DocumentCode
    1184217
  • Title

    Fully vector-quantized neural network-based code-excited nonlinear predictive speech coding

  • Author

    Wu, Lizhong ; Niranjan, Mahesan ; Fallside, Frank

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Oregon Graduate Inst., Portland, OR, USA
  • Volume
    2
  • Issue
    4
  • fYear
    1994
  • fDate
    10/1/1994 12:00:00 AM
  • Firstpage
    482
  • Lastpage
    489
  • Abstract
    Recent studies have shown that nonlinear predictors can achieve about 2-3 dB improvement in speech prediction over conventional linear predictors. In this paper, we exploit the advantage of the nonlinear prediction capability of neural networks and apply it to the design of improved predictive speech coders. Our studies concentrate on the following three aspects: (a) the development of short-term (formant) and long-term (pitch) nonlinear predictive vector quantizers (b) the analysis of the output variance of the nonlinear predictive filter with respect to the input disturbance (c) the design of nonlinear predictive speech coders. The above studies have resulted in a fully vector-quantized, code-excited, nonlinear predictive speech coder. Performance evaluations and comparisons with linear predictive speech coding are presented. These tests have shown the applicability of nonlinear prediction in speech coding and the improvement in coding performance
  • Keywords
    linear predictive coding; neural nets; speech coding; vector quantisation; code-excited nonlinear predictive speech coding; coding performance; formant VQ; input disturbance; long-term VQ; neural networks; nonlinear predictive filter; output variance; performance evaluations; pitch VQ; predictive speech coders; short-term VQ; speech prediction; vector quantizers; Analysis of variance; Equations; Filters; Neural networks; Predictive models; Recurrent neural networks; Speech analysis; Speech coding; Testing; Vectors;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.326608
  • Filename
    326608