• DocumentCode
    2398382
  • Title

    Nonlinear long-term prediction of speech signals

  • Author

    Birgmeier, Martin ; Bernhard, Hans-Peter ; Kubin, Gernot

  • Author_Institution
    Inst. of Commun. & Radio-Frequency Eng., Vienna Univ. of Technol., Austria
  • Volume
    2
  • fYear
    1997
  • fDate
    21-24 Apr 1997
  • Firstpage
    1283
  • Abstract
    This paper presents an in-depth study of nonlinear long-term prediction of speech signals. While previous studies of nonlinear prediction focused on short-term prediction (with only moderate performance advantage over adaptive linear prediction in most cases), successful long-term prediction strongly depends on the nonlinear oscillator framework for speech modeling. This hypothesis has been confirmed in a series of experiments run on a voiced speech database. We provide results for the prediction gain as a function of the prediction delay using two methods. One is based on an extended form of radial basis function networks and is intended to show what performance can be reached using a nonlinear predictor. The other relies on calculating the mutual information between multiple signal samples. We explain the role of this mutual information function as the upper bound on the achievable prediction gain. We show that with matching memory and dimension, the two methods yield nearly the same value for the achievable prediction gain. We try to make a fair comparison of these values against those obtained using optimized linear predictors of various orders. It turns out that the nonlinear predictor´s gain is significantly higher than that for a linear predictor using the same parameters
  • Keywords
    delays; feedforward neural nets; prediction theory; signal sampling; speech processing; adaptive linear prediction; experiments; linear predictor gain; matching memory; multiple signal samples; mutual information function; nonlinear long term prediction; nonlinear oscillator; nonlinear predictor gain; optimized linear predictors; prediction delay; radial basis function networks; speech modeling; speech signals; upper bound; voiced speech database; Databases; Delay; Mutual information; Oscillators; Predictive models; Radial basis function networks; Radio frequency; Signal sampling; Speech processing; Upper bound;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
  • Conference_Location
    Munich
  • ISSN
    1520-6149
  • Print_ISBN
    0-8186-7919-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1997.596180
  • Filename
    596180