DocumentCode :
3387374
Title :
Decorrelating transforms for spectral vector quantization
Author :
Arjona Ramirez, Miguel
Author_Institution :
Dept. of Electron. Syst. Eng., Univ. of Sao Paulo, Sao Paulo, Brazil
fYear :
2013
fDate :
1-3 July 2013
Firstpage :
1
Lastpage :
6
Abstract :
Split vector quantization (SVQ) performs well and efficiently for line spectral frequency (LSF) quantization, but misses some component dependencies. Switched SVQ (SSVQ) can restore some advantage due to nonlinear dependencies through Gaussian Mixture Models (GMM). Remaining linear dependencies or correlations between vector components can be used to advantage by transform coding. The Karhunen-Loève transform (KLT) is normally used but eigendecomposition and full transform matrices make it computationally complex. However, a family of transforms has been recently characterized by the capability of generalized triangular decomposition (GTD) of the source covariance matrix. The prediction-based lower triangular transform (PLT) is the least complex of such transforms and is a component in the implementation of all of them. This paper proposes a minimum noise structure for PLT SVQ. Coding results for 16-dimensional LSF vectors from wideband speech show that GMM PLT SSVQ can achieve transparent quantization down to 41 bit/frame with distortion performance close to GMM KLT SSVQ at about three-fourths as much operational complexity. Other members of the GTD family, such as the geometric mean decomposition (GMD) transform and the bidiagonal (BID) transform, fail to capitalize on their advantageous features due to the low bit rate per component in the range tested.
Keywords :
Gaussian processes; Karhunen-Loeve transforms; computational complexity; covariance matrices; decorrelation; eigenvalues and eigenfunctions; prediction theory; spectral analysis; transform coding; vector quantisation; BID transform; GMD transform; GMM PLT SSVQ; GTD family; Gaussian mixture models; KLT; Karhunen-Loève transform; LSF quantization; LSF vectors; PLT SVQ; bidiagonal transform; component dependency; computational complexity; decorrelating transforms; distortion performance; eigendecomposition; generalized triangular decomposition; geometric mean decomposition transform; line spectral frequency quantization; minimum noise structure; nonlinear dependency; operational complexity; prediction-based lower triangular transform; source covariance matrix; spectral vector quantization; split vector quantization; switched SVQ; transform coding; transform matrices; transparent quantization; vector components; wideband speech; Bit rate; Covariance matrices; Matrix decomposition; Quantization (signal); Switches; Transforms; Vectors; Gaussian mixture models; Karhunen-Loève transform; generalized triangular decomposition; line spectral frequencies; linear prediction; prediction-based lower triangular transform; switched split vector quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Digital Signal Processing (DSP), 2013 18th International Conference on
Conference_Location :
Fira
ISSN :
1546-1874
Type :
conf
DOI :
10.1109/ICDSP.2013.6622682
Filename :
6622682
Link To Document :
بازگشت