DocumentCode
2527625
Title
Minimal marker sets to discriminate among seedlines
Author
Hudson, Thomas C. ; Stapleton, Ann E. ; Curley, Amy M.
Author_Institution
Dept. of Comput. Sci. & Biol. Sci., North Carolina Univ., Wilmington, NC, USA
fYear
2005
fDate
8-11 Aug. 2005
Firstpage
120
Lastpage
121
Abstract
Raising seeds for biological experiments is prone to error; a careful experimenter will test in the lab to verify that plants are of the intended strain. Choosing a minimal set of tests that will discriminate between all known seedlines is an instance of Minimal Test Set, a NP-complete problem. Similar biological problems, such as minimizing the number of haplotype tag SNPs, require complex nondeterministic heuristics to solve in reasonable timeframes over modest datasets. However, selecting the minimal marker set to discriminate among seedlines is less complicated than other problems considered in the literature; we show that a simple heuristic approach works well in practice. Finding all minimal sets of tests to identify 91 Zea mays recombinant inbred lines would require months of CPU time; our heuristic gives a result less than twice the minimal possible size in under five seconds, with similar performance on Arabidopsis thaliana recombinant inbred lines.
Keywords
agriculture; biology computing; genetics; molecular biophysics; CPU time Arabidopsis thaliana; haplotype tag SNPs; intended strain; minimal marker sets; nondeterministic heuristics; raising seeds; recombinant inbred lines; Bioinformatics; Biology; Capacitive sensors; Computer errors; Computer science; NP-complete problem; Open source software; Plants (biology); Sampling methods; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Systems Bioinformatics Conference, 2005. Workshops and Poster Abstracts. IEEE
Print_ISBN
0-7695-2442-7
Type
conf
DOI
10.1109/CSBW.2005.92
Filename
1540566
Link To Document