• DocumentCode
    2735294
  • Title

    Linear reduction methods for tag SNP selection

  • Author

    He, Jingwu ; Zelikovsky, Alex

  • Author_Institution
    Dept. of Comput. Sci., Georgia State Univ., Atlanta, GA, USA
  • Volume
    2
  • fYear
    2004
  • fDate
    1-5 Sept. 2004
  • Firstpage
    2840
  • Lastpage
    2843
  • Abstract
    It is widely hoped that constructing a complete human haplotype map will help to associate complex diseases with certain SNP´s. Unfortunately, the number of SNP´s is huge and it is very costly to sequence many individuals. Therefore, it is desirable to reduce the number of SNP´s that should be sequenced to considerably small number of informative representatives, so called tag SNP´s. In this paper, we propose a new linear algebra based method for selecting and using tag SNP´s. Our method is purely combinatorial and can be combined with linkage disequilibrium (LD) and block based methods. We measure the quality of our tag SNP selection algorithm by comparing actual SNP´s with SNP´s linearly predicted from linearly chosen tag SNP´s. We obtain an extremely good compression and prediction rates. For example, for long haplotypes (>25000 SNP´s), knowing only 0.4% of all SNP´s we predict the entire unknown haplotype with 2% accuracy while the prediction method is based on a 10% sample of the population.
  • Keywords
    biochemistry; biology computing; combinatorial mathematics; diseases; linear algebra; molecular biophysics; organic compounds; polymorphism; block based methods; combinatorial method; human haplotype map; linear algebra based method; linear reduction methods; linkage disequilibrium methods; tag single nucleotide polymorphism selection algorithm; Accuracy; Computer science; Couplings; Diseases; Genomics; Helium; Humans; Linear algebra; Prediction methods; Technical Activities Guide -TAG; Single nucleotide polymorphism; linear independence; tag SNP;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Engineering in Medicine and Biology Society, 2004. IEMBS '04. 26th Annual International Conference of the IEEE
  • Conference_Location
    San Francisco, CA
  • Print_ISBN
    0-7803-8439-3
  • Type

    conf

  • DOI
    10.1109/IEMBS.2004.1403810
  • Filename
    1403810