Title :
Decorrelating transforms for spectral vector quantization
Author :
Arjona Ramirez, Miguel
Author_Institution :
Dept. of Electron. Syst. Eng., Univ. of Sao Paulo, Sao Paulo, Brazil
Abstract :
Split vector quantization (SVQ) performs well and efficiently for line spectral frequency (LSF) quantization, but misses some component dependencies. Switched SVQ (SSVQ) can restore some advantage due to nonlinear dependencies through Gaussian Mixture Models (GMM). Remaining linear dependencies or correlations between vector components can be used to advantage by transform coding. The Karhunen-Loève transform (KLT) is normally used but eigendecomposition and full transform matrices make it computationally complex. However, a family of transforms has been recently characterized by the capability of generalized triangular decomposition (GTD) of the source covariance matrix. The prediction-based lower triangular transform (PLT) is the least complex of such transforms and is a component in the implementation of all of them. This paper proposes a minimum noise structure for PLT SVQ. Coding results for 16-dimensional LSF vectors from wideband speech show that GMM PLT SSVQ can achieve transparent quantization down to 41 bit/frame with distortion performance close to GMM KLT SSVQ at about three-fourths as much operational complexity. Other members of the GTD family, such as the geometric mean decomposition (GMD) transform and the bidiagonal (BID) transform, fail to capitalize on their advantageous features due to the low bit rate per component in the range tested.
Keywords :
Gaussian processes; Karhunen-Loeve transforms; computational complexity; covariance matrices; decorrelation; eigenvalues and eigenfunctions; prediction theory; spectral analysis; transform coding; vector quantisation; BID transform; GMD transform; GMM PLT SSVQ; GTD family; Gaussian mixture models; KLT; Karhunen-Loève transform; LSF quantization; LSF vectors; PLT SVQ; bidiagonal transform; component dependency; computational complexity; decorrelating transforms; distortion performance; eigendecomposition; generalized triangular decomposition; geometric mean decomposition transform; line spectral frequency quantization; minimum noise structure; nonlinear dependency; operational complexity; prediction-based lower triangular transform; source covariance matrix; spectral vector quantization; split vector quantization; switched SVQ; transform coding; transform matrices; transparent quantization; vector components; wideband speech; Bit rate; Covariance matrices; Matrix decomposition; Quantization (signal); Switches; Transforms; Vectors; Gaussian mixture models; Karhunen-Loève transform; generalized triangular decomposition; line spectral frequencies; linear prediction; prediction-based lower triangular transform; switched split vector quantization;
Conference_Titel :
Digital Signal Processing (DSP), 2013 18th International Conference on
Conference_Location :
Fira
DOI :
10.1109/ICDSP.2013.6622682