DocumentCode :
1245280
Title :
Termination and continuity of greedy growing for tree-structured vector quantizers
Author :
Nobel, Andrew B. ; Olshen, Richard A.
Author_Institution :
Dept. of Stat., North Carolina Univ., Chapel Hill, NC, USA
Volume :
42
Issue :
1
fYear :
1996
fDate :
1/1/1996 12:00:00 AM
Firstpage :
191
Lastpage :
205
Abstract :
Tree-structured vector quantizers (TSVQ) provide a computationally efficient, variable-rate method of compressing vector-valued data. In applications, the problem of designing a TSVQ from empirical training data is critical. Greedy growing algorithms are a common and effective approach to the design problem. They are recursive procedures that produce a TSVQ one node at a time by optimizing a simple splitting criterion at each step. While unsupervised greedy growing algorithms are well-understood from an experimental point of view, there has been little theory to support their use, or to examine their behavior on large training sets. The authors present a rigorous analysis of a greedy growing algorithm proposed by Riskin (1990), Riskin and Gray (1991), and Balakrishnan (1991). The first part of the paper is a description of the algorithm and an examination of its asymptotic behavior as it applies to a fixed, absolutely continuous distribution. The second part of the paper establishes the structural consistency of the algorithm with respect to a convergent sequence of distributions. As an application, the authors obtain results concerning the large-sample empirical behavior of the algorithm when it is applied to stationary ergodic training vectors
Keywords :
image coding; tree data structures; variable rate codes; vector quantisation; asymptotic behavior; continuity; convergent sequence of distributions; empirical training data; fixed absolutely continuous distribution; greedy growing; large-sample empirical behavior; recursive procedures; splitting criterion; stationary ergodic training vectors; structural consistency; termination; tree-structured vector quantizers; variable-rate method; vector-valued data compression; Algorithm design and analysis; Binary trees; Biomedical imaging; Data compression; Distortion measurement; Greedy algorithms; Risk analysis; Speech recognition; Statistics; Training data;
fLanguage :
English
Journal_Title :
Information Theory, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9448
Type :
jour
DOI :
10.1109/18.481789
Filename :
481789
Link To Document :
بازگشت