• DocumentCode
    2736099
  • Title

    The role of error-coding in shaping the nucleotide alphabet: nature´s choice of A,U, C and G

  • Author

    Dónaill, D. A Mac

  • Author_Institution
    Dept. of Chem., Trinity Coll., Dublin, Ireland
  • Volume
    4
  • fYear
    2003
  • fDate
    17-21 Sept. 2003
  • Firstpage
    3850
  • Abstract
    The paper summarizes a proposed role for error-coding in shaping the composition of the natural nucleotide alphabet. It is argued that as nucleotide replication is essentially an information transmission phenomenon, the evolutionary pressures shaping alphabet composition might not have been confined to physicochemical issues alone, and that error-coding might have had a constraining evolutionary role. Nucleotide recognition features are mapped to 4-digit binary numbers, capturing the hydrogen donor/acceptor patterns (3-bits) and the purine/pyrimidine size motif (1-bit). An error-coding analysis, treating potential alphabets as combinations of 4-bit codewords, suggests that optimal alphabets are composed of nucleotides in which the purine/pyrimidine feature relates to the donor/acceptor pattern as a parity bit. Thus, of sixteen possible informationally distinct nucleotides, a viable alphabet, subject to error-coding constraints, may draw from either one of two sets of eight same-parity nucleotides, but not from both. Chemical considerations such as tautomeric instability, and vulnerability to hydrolysis, render several potential nucleotides unsuitable, reducing the size of the available sets. Two solutions emerge; one corresponds precisely to the natural alphabet of A, C, G and T. A second, less favored solution also appears viable, offering the possibility of life based on an alternative alphabet.
  • Keywords
    DNA; biological techniques; coding errors; evolution (biological); genetic engineering; hydrogen; isomerism; parity check codes; replica techniques; error-coding theory; hydrogen donor-acceptor patterns; hydrolysis; nucleotide alphabet composition shaping; nucleotide recognition; nucleotide replication; parity bit; purine-pyrimidine size; tautomeric instability; Chemicals; Chemistry; Computer errors; Educational institutions; Error analysis; Error correction; Genetics; Hydrogen; Pattern analysis; Pattern recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Engineering in Medicine and Biology Society, 2003. Proceedings of the 25th Annual International Conference of the IEEE
  • ISSN
    1094-687X
  • Print_ISBN
    0-7803-7789-3
  • Type

    conf

  • DOI
    10.1109/IEMBS.2003.1281003
  • Filename
    1281003