• DocumentCode
    1065822
  • Title

    Sequencing-by-hybridization revisited: the analog-spectrum proposal

  • Author

    Preparata, Franco P.

  • Author_Institution
    Dept. of Comput. Sci., Brown Univ., Providence, RI, USA
  • Volume
    1
  • Issue
    1
  • fYear
    2004
  • Firstpage
    46
  • Lastpage
    52
  • Abstract
    All published approaches to DNA sequencing by hybridization (SBH) consist of the biochemical acquisition of the spectrum of a target sequence (the set of its subsequences conforming to a given probing pattern) followed by the algorithmic reconstruction of the sequence from its spectrum. In the "standard" or "uniform" approach, the probing pattern is a string of length L and the length of reliably reconstructible sequences is known to be mlen = O(2L). For a fixed microarray area, higher sequencing performance can be achieved by inserting nonprobing gaps ("wild-cards") in the probing pattern. The reconstruction, however, must cope with the emergence of fooling probes due to the gaps and algorithmic failure occurs when the spectrum becomes too densely populated, although we can achieve mcomp = O(4L). Despite the combinatorial success of gapped probing, all current approaches are based on a biochemically unrealistic spectrum-acquisition model (digital-spectrum). The reality of hybridization is much more complex. Departing from the conventional model, in this paper, we propose an alternative, called the analog-spectrum model, which more closely reflects the biochemical process. This novel modeling reestablishes probe length as the performance-governing factor, adopting "semidegenerate bases" as suitable emulators of currently inadequate universal bases. One important conclusion is that accurate biochemical measurements are pivotal to the success of SBH. The theoretical proposal presented in this paper should be a convincing stimulus for the needed biotechnological work.
  • Keywords
    DNA; biochemistry; biological techniques; molecular biophysics; physiological models; DNA sequencing; analog-spectrum model; biochemical acquisition; biochemically unrealistic spectrum-acquisition model; fooling probes; gapped probing; hybridization; probe length; probing pattern; DNA; Labeling; Laboratories; Libraries; Optical wavelength conversion; Probes; Proposals; Sequences; Thermodynamics; Visualization; Index Terms- DNA sequencing; analog spectrum; gapped probes; microarrays; semidegenerate bases.; sequencing-by-hybridization; thermodynamics of hybridization; Algorithms; Computational Biology; DNA; Nucleic Acid Hybridization; Oligonucleotide Array Sequence Analysis; Sequence Analysis, DNA; Thermodynamics; Transition Temperature;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2004.12
  • Filename
    1324619