Karhunen-Loeve method for data compression and speech synthesis

Author

Chen, C.S. ; Huo, K.-S.

Author_Institution

Dept. of Electr. Eng., Akron Univ., OH, USA

Volume

138

Issue

5

fYear

1991

Firstpage

377

Lastpage

380

Abstract

The use of the Karhunen-Loeve (KL) method in speech data compression and synthesis using the Fourier-Bessel (FB) expansion coefficients of speech signals is described. Bessel functions seem to make a natural basis for speech signal decomposition. Sinusoidal functions are the eigenfunctions of vibrating strings. Bessel functions are the eigenfunctions of vibrating pipes. The vocal tract resembles an excited pipe rather than a vibrating string. Good quality intelligible speech signals can be reconstructed using only a small portion of the FB expansion coefficient. Further data compression is possible through KL transformation of the speech signal FB expansion coefficient for efficient speech coding and synthesis. The transformation is implemented by first forming a covariance matrix of the FB coefficients. Eigenvalues and eigenvectors of the covariance matrix are computed and ranked according to the eigenvalue magnitude. Speech signals are then reconstructed using only the feature corresponding to the larger magnitude eigenvalues of the covariance matrix.<>

Keywords

Bessel functions; data compression; eigenvalues and eigenfunctions; speech analysis and processing; speech synthesis; Fourier-Bessel expansion coefficient; Karhunen-Loeve method; covariance matrix; data compression; eigenfunctions; eigenvalues; eigenvectors; speech coding; speech synthesis; vibrating pipes; vocal tract;

fLanguage

English

Journal_Title

Communications, Speech and Vision, IEE Proceedings I

Publisher

iet

ISSN

0956-3776

Type

jour

Filename

103836