Author :
György, András ; Linder, Tamás ; Lugosi, Gábor
Abstract :
In the framework of prediction of individual sequences, sequential prediction methods are to be constructed that perform nearly as well as the best expert from a given class. We consider prediction strategies that compete with the class of switching strategies that can segment a given sequence into several blocks, and follow the advice of a different “base” expert in each block. As usual, the performance of the algorithm is measured by the regret defined as the excess loss relative to the best switching strategy selected in hindsight for the particular sequence to be predicted. In this paper, we construct prediction strategies of low computational cost for the case where the set of base experts is large. In particular, we provide a method that can transform any prediction algorithm A that is designed for the base class into a tracking algorithm. The resulting tracking algorithm can take advantage of the prediction performance and potential computational efficiency of A in the sense that it can be implemented with time and space complexity only O(nγ lnn) times larger than that of A, where n is the time horizon and γ ≥ 0 is a parameter of the algorithm. With A properly chosen, our algorithm achieves a regret bound of optimal order for γ >; 0, and only O(lnn) times larger than the optimal order for γ = 0 for all typical regret bound types we examined. For example, for predicting binary sequences with switching parameters under the logarithmic loss, our method achieves the optimal O(lnn) regret rate with time complexity O(n1+γlnn) for any γ ∈ (0,1).
Keywords :
binary sequences; computational complexity; prediction theory; base expert; binary sequence; computational cost; logarithmic loss; sequential prediction method; space complexity; time complexity; tracking algorithm; Algorithm design and analysis; Complexity theory; Data compression; Heuristic algorithms; Prediction algorithms; Signal processing algorithms; Switches; Algorithmic efficiency; changing environments; compound experts; data compression; individual sequences; nonstationary sources; sequential coding; sequential decision making; sequential prediction;