Title :
Efficient and scalable parallel reconstruction of sibling relationships from genetic data in wild populations
Author :
Sheikh, Saad ; Khokhar, Ashfaq ; Berger-Wolf, Tanya
Author_Institution :
Dept. of Comput. Sci., Univ. of Illinois at Chicago, Chicago, IL, USA
Abstract :
Wild populations of organism are often difficult to study in their natural settings. Often, it is possible to infer mating information about these species by genotyping the offspring and using the genetic information to infer sibling, and other kinship, relationships. While sibling reconstruction has been studied for a long time, none of the existing approaches have targeted scalability. In this paper, we introduce the first parallel approach to reconstructing sibling relationships from microsatellite markers. We use both functional and data domain decomposition to break down the problem and argue that this approach can be applied to other problems where columns are independent and simple constraint-based enumeration is required. We discuss algorithmic and implementation choices and their effects on results. We show that our approach is highly efficient and scalable.
Keywords :
biology computing; genetics; parallel processing; constraint-based enumeration; data domain decomposition; functional decomposition; genetic data; genetic information; microsatellite markers; parallel reconstruction; sibling reconstruction; sibling relationship; wild population;
Conference_Titel :
Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW), 2010 IEEE International Symposium on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4244-6533-0
DOI :
10.1109/IPDPSW.2010.5470892