DocumentCode
1355925
Title
On similarity codes
Author
Yachkov, Arkadii G D ; Torney, David C.
Author_Institution
Dept. of Probability Theory, Moscow State Univ., Russia
Volume
46
Issue
4
fYear
2000
fDate
7/1/2000 12:00:00 AM
Firstpage
1558
Lastpage
1564
Abstract
We introduce a biologically motivated measure of sequence similarity for quaternary N-sequences, extending Hamming similarity. This measure is the sum over the length of the sequences of “alphabetic” similarities at all positions. Alphabetic similarities are defined, symmetrically, on the Cartesian square of the alphabet. These similarities equal zero whenever the two elements differ. In distinction to Hamming similarity, however, our alphabetic similarities take individual values whenever the two elements are identical. In this correspondence we derive lower and upper bounds on the rate of the corresponding quaternary nonlinear and linear codes called similarity codes and applied to DNA sequences
Keywords
DNA; fractals; linear codes; nonlinear codes; random codes; sequences; Cartesian square; DNA sequences; Hamming similarity; alphabetic similarities; biologically motivated measure; code rate; cross-similarity; lower bound; quaternary N-sequences; quaternary linear codes; quaternary nonlinear codes; random coding bound; self similarity; sequence length; sequence similarity; similarity codes; upper bound; Application software; Computer science; Cryptography; Error correction codes; Information theory; Libraries; Linear code; Web server;
fLanguage
English
Journal_Title
Information Theory, IEEE Transactions on
Publisher
ieee
ISSN
0018-9448
Type
jour
DOI
10.1109/18.850695
Filename
850695
Link To Document