DocumentCode :
3121063
Title :
Suffix trees for universal source modeling with applications
Author :
Gibble, Jay ; Paris, Bernd-Peter
Author_Institution :
Q.E.D. Inc., Reston, VA, USA
fYear :
2011
fDate :
23-25 March 2011
Firstpage :
1
Lastpage :
6
Abstract :
Suffix trees are shown to reveal significant information about the internal structure of an individual sequence S. Specifically it is shown how the number of occurrences of any subsequence of S, the recurrence period for any subsequence that occurs at least twice in S, and the longest subsequence that occurs at least twice in S are determined from the sequences suffix tree. Since suffix trees can be constructed with low, O(N) computational complexity, they provide a powerful tool for information-theoretic sequence analysis. We further demonstrate the utility of suffix trees by using the collected statistics for two common problems in information theory: variable order source modeling and entropy estimation.
Keywords :
Markov processes; computational complexity; entropy; matrix algebra; sequences; trees (mathematics); computational complexity; entropy estimation; information-theoretic sequence analysis; sequence suffix tree; universal source modeling; variable order Markov model; variable order source modeling; Computational modeling; Context; Entropy; Indexes; Markov processes; Pattern matching; Reduced order systems; Suffix tree; entropy estimation; finite state machine; variable order Markov model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Sciences and Systems (CISS), 2011 45th Annual Conference on
Conference_Location :
Baltimore, MD
Print_ISBN :
978-1-4244-9846-8
Electronic_ISBN :
978-1-4244-9847-5
Type :
conf
DOI :
10.1109/CISS.2011.5766226
Filename :
5766226
Link To Document :
بازگشت