DocumentCode :
3438358
Title :
Large alphabets: Finite, infinite, and scaling models
Author :
Ohannessian, Mesrob I. ; Dahleh, Munther A.
Author_Institution :
Lab. for Inf. & Decision Syst., Massachusetts Inst. of Technol., Cambridge, MA, USA
fYear :
2012
fDate :
21-23 March 2012
Firstpage :
1
Lastpage :
5
Abstract :
How can we effectively model situations with large alphabets? On a pragmatic level, any engineered system, be it for inference, communication, or encryption, requires working with a finite number of symbols. Therefore, the most straight-forward model is a finite alphabet. However, to emphasize the disproportionate size of the alphabet, one may want to compare its finite size with the length of data at hand. More generally, this gives rise to scaling models that strive to capture regimes of operation where one anticipates such imbalance. Large alphabets may also be idealized as infinite. The caveat then is that such generality strips away many of the convenient machinery of finite settings. However, some of it may be salvaged by refocusing the tasks of interest, such as by moving from sequence to pattern compression, or by minimally restricting the classes of infinite models, such as via tail properties. In this paper we present an overview of models for large alphabets, some recent results, and possible directions in this area.
Keywords :
computational linguistics; convenient machinery; engineered system; finite models; finite settings; infinite models; large alphabets; scaling models; Convergence; Entropy; Equations; Estimation; Information theory; Mathematical model; Redundancy;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Sciences and Systems (CISS), 2012 46th Annual Conference on
Conference_Location :
Princeton, NJ
Print_ISBN :
978-1-4673-3139-5
Electronic_ISBN :
978-1-4673-3138-8
Type :
conf
DOI :
10.1109/CISS.2012.6310941
Filename :
6310941
Link To Document :
بازگشت