DocumentCode :
2030974
Title :
Pattern Matching in Constrained Sequences
Author :
Yongwook Choi ; Szpankowski, W.
Author_Institution :
Dept. of Comput. Sci., Purdue Univ., West Lafayette, IN
fYear :
2007
fDate :
24-29 June 2007
Firstpage :
2606
Lastpage :
2610
Abstract :
Constrained sequences find applications in communication, magnetic recording, and biology. In this paper, we restrict our attention to the so-called (d, k) constrained binary sequences in which any run of zeros must be of length at least d and at most k, where 0lesd<k. In some applications one needs to know the number of occurrences of a given pattern w in such sequences, for which we coin the term constrained pattern matching. For a given word w or a set of words W, we estimate the (conditional) probability of the number of occurrences of w in a (d, k) sequence generated by a memoryless source. As a by-product, we enumerate asymptotically the number of (d, k) sequences with exactly r occurrences of a given word w, and compute Shannon entropy of (d, k) sequences with a given number of occurrences of w. Throughout this paper we use techniques of analytic information theory such as combinatorial calculus, generating functions, and complex asymptotics.
Keywords :
binary sequences; entropy; pattern matching; probability; Shannon entropy; combinatorial calculus; complex asymptotics; conditional probability; constrained binary sequences; constrained pattern matching; generating function; information theory; Application software; Binary sequences; Biodiversity; Biological system modeling; Computer science; Constraint theory; Digital communication; Neurons; Pattern analysis; Pattern matching;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Theory, 2007. ISIT 2007. IEEE International Symposium on
Conference_Location :
Nice
Print_ISBN :
978-1-4244-1397-3
Type :
conf
DOI :
10.1109/ISIT.2007.4557611
Filename :
4557611
Link To Document :
بازگشت