DocumentCode
397320
Title
Universal compression for I.I.D. sources with large alphabets
Author
Shamir, Gil I.
Author_Institution
Dept. of Electr. & Comput. Eng., Utah Univ., Salt Lake City, UT, USA
fYear
2003
fDate
29 June-4 July 2003
Firstpage
24
Abstract
The minimum description length (MDL) principle is derived for universal compression of i.i.d. sources with large alphabets of size k that may be up to sub-linear with the data sequence length n. Each unknown source probability parameter is shown to cost 0.5log(n/k) bits. This result is shown to be a lower bound in the average minimax sense, and also for most sources in the class. The bound is shown to be achievable even sequentially with the well-known Krichevsky-Trofimov low-complexity scheme.
Keywords
minimax techniques; probability; source coding; Krichevsky-Trofimov low-complexity scheme; data sequence length; large alphabets; minimax sense; minimum description length; source probability; universal compression; Channel capacity; Cities and towns; Costs; Entropy; Error probability; Gas insulated transmission lines; Helium; Minimax techniques; Random sequences; Redundancy;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Theory, 2003. Proceedings. IEEE International Symposium on
Print_ISBN
0-7803-7728-1
Type
conf
DOI
10.1109/ISIT.2003.1228038
Filename
1228038
Link To Document