DocumentCode
1010500
Title
Finite-memory universal prediction of individual sequences
Author
Meron, Eado ; Feder, Meir
Author_Institution
Dept. of Electr. Eng.-Syst., Tel-Aviv Univ., Ramat-Aviv, Israel
Volume
50
Issue
7
fYear
2004
fDate
7/1/2004 12:00:00 AM
Firstpage
1506
Lastpage
1523
Abstract
The problem of predicting the next outcome of an individual binary sequence under the constraint that the universal predictor has a finite memory, is explored. In this analysis, the finite-memory universal predictors are either deterministic or random time-invariant finite-state (FS) machines with K states (K-state machines). The paper provides bounds on the asymptotic achievable regret of these constrained universal predictors as a function of K, the number of their states, for long enough sequences. The specific results are as follows. When the universal predictors are deterministic machines, the comparison class consists of constant predictors, and prediction is with respect to the 0-1 loss function (Hamming distance), we get tight bounds indicating that the optimal asymptotic regret is 1/(2K). In that case of K-state deterministic universal predictors, the constant predictors comparison class, but prediction is with respect to the self-information (code length) and the square-error loss functions, we show an upper bound on the regret (coding redundancy) of O(K-23/) and a lower bound of Θ(K-45/). For these loss functions, if the predictor is allowed to be a random K-state machine, i.e., a machine with random state transitions, we get a lower bound of Θ(1/K) on the regret, with a matching upper bound of O(1/K) for the square-error loss, and an upper bound of O(logK/K) Throughout the paper for the self-information loss. In addition, we provide results for all these loss functions in the case where the comparison class consists of all predictors that are order-L Markov machines.
Keywords
Markov processes; binary sequences; finite state machines; prediction theory; Markov machines; exponentially decaying memory; finite-memory universal prediction; imaginary sliding window; individual binary sequences; lower bound; random state transitions; saturated counter; self-information loss; square-error loss functions; time-invariant finite-state machines; universal coding; upper bound; Binary sequences; Convergence; Counting circuits; Data compression; Hamming distance; Information theory; Prediction theory; Signal processing; Upper bound;
fLanguage
English
Journal_Title
Information Theory, IEEE Transactions on
Publisher
ieee
ISSN
0018-9448
Type
jour
DOI
10.1109/TIT.2004.830749
Filename
1306548
Link To Document