DocumentCode
831945
Title
Karhunen-Loeve method for data compression and speech synthesis
Author
Chen, C.S. ; Huo, K.-S.
Author_Institution
Dept. of Electr. Eng., Akron Univ., OH, USA
Volume
138
Issue
5
fYear
1991
Firstpage
377
Lastpage
380
Abstract
The use of the Karhunen-Loeve (KL) method in speech data compression and synthesis using the Fourier-Bessel (FB) expansion coefficients of speech signals is described. Bessel functions seem to make a natural basis for speech signal decomposition. Sinusoidal functions are the eigenfunctions of vibrating strings. Bessel functions are the eigenfunctions of vibrating pipes. The vocal tract resembles an excited pipe rather than a vibrating string. Good quality intelligible speech signals can be reconstructed using only a small portion of the FB expansion coefficient. Further data compression is possible through KL transformation of the speech signal FB expansion coefficient for efficient speech coding and synthesis. The transformation is implemented by first forming a covariance matrix of the FB coefficients. Eigenvalues and eigenvectors of the covariance matrix are computed and ranked according to the eigenvalue magnitude. Speech signals are then reconstructed using only the feature corresponding to the larger magnitude eigenvalues of the covariance matrix.<>
Keywords
Bessel functions; data compression; eigenvalues and eigenfunctions; speech analysis and processing; speech synthesis; Fourier-Bessel expansion coefficient; Karhunen-Loeve method; covariance matrix; data compression; eigenfunctions; eigenvalues; eigenvectors; speech coding; speech synthesis; vibrating pipes; vocal tract;
fLanguage
English
Journal_Title
Communications, Speech and Vision, IEE Proceedings I
Publisher
iet
ISSN
0956-3776
Type
jour
Filename
103836
Link To Document