• DocumentCode
    312013
  • Title

    An application of recurrent neural networks to low bit rate speech coding

  • Author

    Kohata, Minoru

  • Author_Institution
    Fac. of Eng., Tohoku Univ., Sendai, Japan
  • Volume
    1
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    314
  • Abstract
    It is well known that the LSP coefficient which represents the speech spectrum envelope as one of the linear prediction coefficients, shows good performance for spectral interpolation along the time axis, but it is also known that the duration of interpolation is limited up to 20~30 ms. This limitation makes it difficult to reduce the bit rate in very low bit rate speech coding. To resolve this problem, recurrent neural networks (RNN) were applied to interpolate LSP coefficients, and it was possible to increase the duration of interpolation to about 100 ms without so much degradation of the synthesized speech quality
  • Keywords
    interpolation; linear predictive coding; recurrent neural nets; speech coding; LSP coefficient; linear prediction coefficients; low bit rate speech coding; recurrent neural networks; spectral interpolation; speech spectrum envelope; synthesized speech quality; Bit rate; Degradation; Delay effects; Ear; Interpolation; Network synthesis; Neural networks; Recurrent neural networks; Speech coding; Speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607116
  • Filename
    607116