مرکز منطقه ای اطلاع رساني علوم و فناوري - High-Rate Optimized Recursive Vector Quantization Structures Using Hidden Markov Models

DocumentCode :

1118057

Title :

High-Rate Optimized Recursive Vector Quantization Structures Using Hidden Markov Models

Author :

Duni, Ethan R. ; Rao, Bhaskar D.

Author_Institution :

Dept. of Electr. & Comput. Eng., California Univ., San Diego, La Jolla, CA

Volume :

Issue :

fYear :

2007

fDate :

3/1/2007 12:00:00 AM

Firstpage :

756

Lastpage :

769

Abstract :

This paper examines the design of recursive vector quantization systems built around Gaussian mixture vector quantizers. The problem of designing such systems for minimum high-rate distortion, under input-weighted squared error, is discussed. It is shown that, in high dimensions, the design problem becomes equivalent to a weighted maximum likelihood problem. A variety of recursive coding schemes, based on hidden Markov models are presented. The proposed systems are applied to the problem of wideband speech line spectral frequency (LSF) quantization under the log spectral distortion (LSD) measure. By combining recursive quantization and random coding techniques, the systems are able to attain transparent quality at rates as low as 36 bits per frame

Keywords :

Gaussian processes; hidden Markov models; maximum likelihood estimation; random codes; speech coding; vector quantisation; Gaussian mixture vector quantizers; hidden Markov models; high-rate optimized recursive vector quantization structures; input-weighted squared error; log spectral distortion measure; minimum high-rate distortion; random coding technique; recursive coding schemes; weighted maximum likelihood problem; wideband speech line spectral frequency quantization; Distortion measurement; Frequency measurement; Hidden Markov models; Image coding; Maximum likelihood estimation; Nonlinear distortion; Parameter estimation; Speech; Vector quantization; Wideband; Hidden Markov model (HMM); high-rate quantization; log spectral distortion (LSD); parameter estimation; wideband speech;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2006.885903

Filename :

4100675

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1118057