• DocumentCode
    1352722
  • Title

    Mutation Region Detection for Closely Related Individuals without a Known Pedigree

  • Author

    Wenji Ma ; Yong Yang ; Zhi-Zhong Chen ; Lusheng Wang

  • Author_Institution
    Dept. of Comput. Sci., City Univ. of Hong Kong, Hong Kong, China
  • Volume
    9
  • Issue
    2
  • fYear
    2012
  • Firstpage
    499
  • Lastpage
    510
  • Abstract
    Linkage analysis serves as a way of finding locations of genes that cause genetic diseases. Linkage studies have facilitated the identification of several hundreds of human genes that can harbor mutations which by themselves lead to a disease phenotype. The fundamental problem in linkage analysis is to identify regions whose allele is shared by all or almost all affected members but by none or few unaffected members. Almost all the existing methods for linkage analysis are for families with clearly given pedigrees. Little work has been done for the case where the sampled individuals are closely related, but their pedigree is not known. This situation occurs very often when the individuals share a common ancestor at least six generations ago. Solving this case will tremendously extend the use of linkage analysis for finding genes that cause genetic diseases. In this paper, we propose a mathematical model (the shared center problem) for inferring the allele-sharing status of a given set of individuals using a database of confirmed haplotypes as reference. We show the NP-completeness of the shared center problem and present a ratio-2 polynomial-time approximation algorithm for its minimization version (called the closest shared center problem). We then convert the approximation algorithm into a heuristic algorithm for the shared center problem. Based on this heuristic, we finally design a heuristic algorithm for mutation region detection. We further implement the algorithms to obtain a software package. Our experimental data show that the software is both fast and accurate. The package is available at >;http://www.cs.cityu.edu.hk/~lwang/software/LDWP/ for noncommercial use.
  • Keywords
    computational complexity; diseases; evolution (biological); genetics; heuristic programming; medical computing; molecular biophysics; optimisation; NP-completeness; allele-sharing status; closest shared center problem; disease phenotype; genes; genetic diseases; haplotypes; heuristic algorithm; linkage analysis; mutation region detection; pedigree; ratio-2 polynomial-time approximation algorithm; shared center problem; Algorithm design and analysis; Approximation algorithms; Biological cells; Couplings; Heuristic algorithms; Inference algorithms; Software algorithms; Haplotype inference; allele-sharing status; and approximation algorithm.; linkage analysis; pedigree; Algorithms; Computational Biology; Databases, Genetic; Female; Genetic Linkage; Haplotypes; Humans; Male; Models, Genetic; Mutation; Pedigree; Polymorphism, Single Nucleotide; Software;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2011.134
  • Filename
    6051420