DocumentCode :
1784726
Title :
Efficient and accurate clustering for large-scale genetic mapping
Author :
Strnadova, Veronika ; Buluc, Aydin ; Chapman, Jarrod ; Gilbert, John R. ; Gonzalez, Jose ; Jegelka, Stefanie ; Rokhsar, Daniel ; Oliker, Leonid
Author_Institution :
Comput. Sci. Dept., Univ. of California, Santa Barbara, Santa Barbara, CA, USA
fYear :
2014
fDate :
2-5 Nov. 2014
Firstpage :
3
Lastpage :
10
Abstract :
High-throughput “next generation” genome sequencing technologies are producing a flood of inexpensive genetic information that is invaluable to genomics research. Sequences of millions of genetic markers are being produced, providing genomics researchers with the opportunity to construct highresolution genetic maps for many complicated genomes. However, the current generation of genetic mapping tools were designed for the small data setting, and are now limited by the prohibitively slow clustering algorithms they employ in the genetic marker-clustering stage. In this work, we present a new approach to genetic mapping based on a fast clustering algorithm that exploits the geometry of the data. Our theoretical and empirical analysis shows that the algorithm can correctly recover linkage groups. Using synthetic and real-world data, including the grand-challenge wheat genome, we demonstrate that our approach can quickly process orders of magnitude more genetic markers than existing tools while retaining - and in some cases even improving - the quality of genetic marker clusters.
Keywords :
bioinformatics; genetics; genomics; microorganisms; genetic marker cluster quality; grand-challenge wheat genome; high-throughput next generation genome sequencing technologies; large-scale genetic mapping based fast clustering algorithm; Bioinformatics; Biological cells; Clustering algorithms; Couplings; Genetics; Sociology; Statistics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2014 IEEE International Conference on
Conference_Location :
Belfast
Type :
conf
DOI :
10.1109/BIBM.2014.6999119
Filename :
6999119
Link To Document :
بازگشت