Title :
An application of universal data compression to statistical analysis of time series
Author_Institution :
Inst. of Comput. Technol., Siberian State Univ. of Telecommun. & Inf., Russia
Abstract :
It is shown that universal data compressors (or universal codes) can be used for solving some of the most important statistical problems for time series. By definition, a universal lossless data compressor (or a universal code) can compress any sequence generated by a stationary and ergodic source asymptotically to the Shannon entropy, which, in turn, is the best achievable ratio for lossless data compressors. We consider finite-alphabet and real-valued time series and the following problems: estimation of the limiting probabilities for finite-alphabet time series and estimation of the density for real-valued time series and the on-line prediction for both types of the time series and the following problems of hypothesis testing: goodness-of-fit testing and testing of serial independence. On the one hand, all problems are considered in the framework of classical mathematical statistics and, on the other hand, everyday methods of data compression (or archivers) can be used as a tool for the estimation and testing. It turns out, that quite often the suggested methods and tests are more powerful than known ones when they are applied in practice.
Keywords :
data compression; statistical analysis; time series; Shannon entropy; finite-alphabet time series; goodness-of-fit testing; lossless data compressor; real-valued time series; sequence compression; serial independence testing; statistical analysis; time series online prediction; universal codes; universal data compression; Compressors; Data compression; Decoding; Entropy; Informatics; NIST; Random number generation; Statistical analysis; Telecommunication computing; Testing;
Conference_Titel :
Information Theory (ITW 2010, Cairo), 2010 IEEE Information Theory Workshop on
Conference_Location :
Cairo
Print_ISBN :
978-1-4244-6372-5
DOI :
10.1109/ITWKSPS.2010.5503156