DocumentCode
430198
Title
Low complexity decomposition for the characteristic waveform of speech signal
Author
Wang, Guiping ; Bao, Changchun
Author_Institution
Speech & Audio Signal Process. Lab, Beijing Univ. of Technol., China
fYear
2004
fDate
15-18 Dec. 2004
Firstpage
145
Lastpage
148
Abstract
For efficient coding of speech, it is desirable to separate the slowly and rapidly evolving spectral components to take advantage of their different perceptual qualities. Existing decomposition methods are too inflexible to model transient changes in the speech signals, require high delay or produce a large parameter set that is not scalable to low rates. We present a low complexity decomposition method, based on SVD, applied to waveform interpolation (WI) coding. This scheme reduces the computational complexity of the common SVD method in WI by exploiting the properties of human auditory perception to lower the dimensions of the decomposition matrix. This method requires only a single frame of speech and overcomes the substantial delay problems. The quantization solution involves the use of vector quantization on the separately decomposed singular matrices, U, V, and the diagonal matrix of singular values, S. The quality of the reconstructed speech can be varied according to the scalable decomposition and the bit rate available.
Keywords
computational complexity; hearing; interpolation; signal reconstruction; singular value decomposition; speech coding; SVD; characteristic waveform; computational complexity; decomposition matrix; diagonal matrix; human auditory perception; singular matrices; spectral components separation; speech coding; speech reconstruction; vector quantization; waveform interpolation coding; Bit rate; Computational complexity; Delay; Finite impulse response filter; Humans; Matrix decomposition; Nonlinear filters; Quantization; Singular value decomposition; Speech coding;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing, 2004 International Symposium on
Print_ISBN
0-7803-8678-7
Type
conf
DOI
10.1109/CHINSL.2004.1409607
Filename
1409607
Link To Document