• DocumentCode
    3259416
  • Title

    Clustering of SNP Data with Application to Genomics

  • Author

    Ng, Michael K. ; Li, Mark J. ; Ao, Sio I. ; Sham, Pak C. ; Cheung, Yiu-Ming ; Huang, Joshua Z.

  • Author_Institution
    Dept. of Math., Hong Kong Baptist Univ.
  • fYear
    2006
  • fDate
    Dec. 2006
  • Firstpage
    158
  • Lastpage
    162
  • Abstract
    Single nucleotide polymorphisms (SNPs) are very common throughout the genome and hence are potentially valuable for mapping disease susceptibility loci by detecting association between SNP markers and disease. Many methods may only be applicable when marker haplotypes, rather than genotypes (categorical data), are available for analysis. In this paper, we explore the properties of k-modes (categorical data) clustering algorithms to SNP data for detecting association between SNP markers and disease. Sub-space k-modes clustering properties are also considered and tested
  • Keywords
    DNA; biology computing; diseases; genetics; pattern clustering; categorical data; clustering algorithms; genomics; genotypes; haplotypes; single nucleotide polymorphisms clustering; subspace k-modes clustering; Application software; Bioinformatics; Chromosome mapping; Clustering algorithms; Couplings; Data mining; Diseases; Genomics; Mathematics; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshops, 2006. ICDM Workshops 2006. Sixth IEEE International Conference on
  • Conference_Location
    Hong Kong
  • Print_ISBN
    0-7695-2702-7
  • Type

    conf

  • DOI
    10.1109/ICDMW.2006.43
  • Filename
    4063617