• DocumentCode
    831945
  • Title

    Karhunen-Loeve method for data compression and speech synthesis

  • Author

    Chen, C.S. ; Huo, K.-S.

  • Author_Institution
    Dept. of Electr. Eng., Akron Univ., OH, USA
  • Volume
    138
  • Issue
    5
  • fYear
    1991
  • Firstpage
    377
  • Lastpage
    380
  • Abstract
    The use of the Karhunen-Loeve (KL) method in speech data compression and synthesis using the Fourier-Bessel (FB) expansion coefficients of speech signals is described. Bessel functions seem to make a natural basis for speech signal decomposition. Sinusoidal functions are the eigenfunctions of vibrating strings. Bessel functions are the eigenfunctions of vibrating pipes. The vocal tract resembles an excited pipe rather than a vibrating string. Good quality intelligible speech signals can be reconstructed using only a small portion of the FB expansion coefficient. Further data compression is possible through KL transformation of the speech signal FB expansion coefficient for efficient speech coding and synthesis. The transformation is implemented by first forming a covariance matrix of the FB coefficients. Eigenvalues and eigenvectors of the covariance matrix are computed and ranked according to the eigenvalue magnitude. Speech signals are then reconstructed using only the feature corresponding to the larger magnitude eigenvalues of the covariance matrix.<>
  • Keywords
    Bessel functions; data compression; eigenvalues and eigenfunctions; speech analysis and processing; speech synthesis; Fourier-Bessel expansion coefficient; Karhunen-Loeve method; covariance matrix; data compression; eigenfunctions; eigenvalues; eigenvectors; speech coding; speech synthesis; vibrating pipes; vocal tract;
  • fLanguage
    English
  • Journal_Title
    Communications, Speech and Vision, IEE Proceedings I
  • Publisher
    iet
  • ISSN
    0956-3776
  • Type

    jour

  • Filename
    103836