DocumentCode :
2950683
Title :
Learning in Glaucoma Genetic Risk Assessment
Author :
Zhang, Zhuo ; Liu, Jiang ; Kwoh, Chee Keong ; Sim, Xueling ; Tay, Wan Ting ; Tan, Yonghua ; Yin, Fengshou ; Wong, Tien Yin
Author_Institution :
Inst. for Infocomm Res., A*STAR, Singapore, Singapore
fYear :
2010
fDate :
Aug. 31 2010-Sept. 4 2010
Firstpage :
6182
Lastpage :
6185
Abstract :
Genome Wide Association (GWA) studies are powerful tools to identify genes involved in common human diseases, and are becoming increasingly important in genetic epidemiology research. However, the statistical approaches behind GWA studies lack capability in taking into account the possible interactions among genetic markers; and true disease variants may be lost in statistical noise due to high threshold. A typical GWA study reports a few highly suspected signals, e.g. Single-nucleotide polymorphisms (SNPs), which usually account for a tiny portion of overall genetic risks for the disease of interest. This study proposes a computational learning approach in addition to parametric statistical methods along with a filtering mechanism, to build glaucoma genetic risk assessment model. Our data set was obtained from Singapore Malay Eye Study (SiMES), genotyped on Illumina 610quad arrays. We constructed case-control data set with 233 glaucoma and 458 healthy samples. A standard case-control association test was conducted on post-QC dataset with more than 500k SNPs. Genetic profile is constructed using genotype information from a list of 412 SNPs filtered by a relaxed p-value threshold of 1×10-3, and forms the feature space for learning. Among the five learning algorithms we performed, Support Vector Machines with radial kernel (SVM-radial) achieved the best result, with area under curve (ROC) of 99.4% and accuracy of 95.9%. The result illustrates that, learning approach in post GWAS data analysis is able to accurately assess genetic risk for glaucoma. The approach is more robust and comprehensive than individual SNPs matching method. We will further validate our results in several other data sets obtained in consequential population studies conducted in Singapore.
Keywords :
genetics; genomics; learning (artificial intelligence); medical computing; statistical analysis; support vector machines; vision defects; Illumina 610quad arrays; SiMES; Singapore Malay Eye Study; computational learning approach; consequential population studies; filtering mechanism; genes; genetic epidemiology research; genetic markers; genetic profile; genome wide association studies; genotyping; glaucoma genetic risk assessment; human diseases; parametric statistical methods; radial kernel; single-nucleotide polymorphisms; statistical noise; support vector machines; Accuracy; Bioinformatics; Diseases; Genomics; Machine learning; Risk management; Adult; Aged; Aged, 80 and over; Algorithms; Databases, Genetic; Genetic Predisposition to Disease; Genome-Wide Association Study; Glaucoma; Humans; Learning; Middle Aged; Polymorphism, Single Nucleotide; ROC Curve; Reproducibility of Results; Risk Assessment;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Engineering in Medicine and Biology Society (EMBC), 2010 Annual International Conference of the IEEE
Conference_Location :
Buenos Aires
ISSN :
1557-170X
Print_ISBN :
978-1-4244-4123-5
Type :
conf
DOI :
10.1109/IEMBS.2010.5627757
Filename :
5627757
Link To Document :
بازگشت