Title :
Speech LSF quantization with rate independent complexity, bit scalability and learning
Author :
Subramaniam, Anand D. ; Rao, Bhaskar D.
Author_Institution :
Dept. of Electr. & Comput. Eng., California Univ., San Diego, La Jolla, CA, USA
Abstract :
A computationally efficient, high quality, vector quantization scheme based on a parametric probability density function (PDF) is proposed. In this scheme, speech line spectral frequencies (LSF) are modeled as i.i.d realizations of a multivariate Gaussian mixture density. The mixture model parameters are efficiently estimated using the expectation maximization (EM) algorithm. An efficient quantization scheme using transform coding and bit allocation techniques which allows for easy and computationally efficient mapping from observation to quantized value is developed for both fixed rate and variable rate systems. An attractive feature of this method is that source encoding using the resultant codebook involves very few searches and its computational complexity is minimal and independent of the rate of the system. Furthermore, the proposed scheme is bit scalable and can switch between memoryless and quantizer with memory seamlessly. The performance of the memoryless quantizer is 2-3 bits better than conventional quantization schemes
Keywords :
Gaussian processes; computational complexity; learning systems; memoryless systems; optimisation; probability; source coding; spectral analysis; speech coding; vector quantisation; EM algorithm; VQ; bit allocation; bit scalability; bit scalable scheme; codebook; computational complexity; computationally efficient mapping; computationally efficient vector quantization; expectation maximization; fixed rate systems; i.i.d multivariate Gaussian mixture density; learning; memory quantizer; memoryless quantizer; parameter estimation; parametric PDF; probability density function; rate independent complexity; source encoding; speech LSF quantization; speech line spectral frequencies; transform coding; variable rate systems; Bit rate; Computational complexity; Encoding; Frequency; Parameter estimation; Probability density function; Speech; Switches; Transform coding; Vector quantization;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7803-7041-4
DOI :
10.1109/ICASSP.2001.941012