DocumentCode
3259416
Title
Clustering of SNP Data with Application to Genomics
Author
Ng, Michael K. ; Li, Mark J. ; Ao, Sio I. ; Sham, Pak C. ; Cheung, Yiu-Ming ; Huang, Joshua Z.
Author_Institution
Dept. of Math., Hong Kong Baptist Univ.
fYear
2006
fDate
Dec. 2006
Firstpage
158
Lastpage
162
Abstract
Single nucleotide polymorphisms (SNPs) are very common throughout the genome and hence are potentially valuable for mapping disease susceptibility loci by detecting association between SNP markers and disease. Many methods may only be applicable when marker haplotypes, rather than genotypes (categorical data), are available for analysis. In this paper, we explore the properties of k-modes (categorical data) clustering algorithms to SNP data for detecting association between SNP markers and disease. Sub-space k-modes clustering properties are also considered and tested
Keywords
DNA; biology computing; diseases; genetics; pattern clustering; categorical data; clustering algorithms; genomics; genotypes; haplotypes; single nucleotide polymorphisms clustering; subspace k-modes clustering; Application software; Bioinformatics; Chromosome mapping; Clustering algorithms; Couplings; Data mining; Diseases; Genomics; Mathematics; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining Workshops, 2006. ICDM Workshops 2006. Sixth IEEE International Conference on
Conference_Location
Hong Kong
Print_ISBN
0-7695-2702-7
Type
conf
DOI
10.1109/ICDMW.2006.43
Filename
4063617
Link To Document