DocumentCode
3336187
Title
A comparative study of feature ranking methods as dimension reduction technique in Genome-Wide Association Study
Author
Ayuningtyas, C.H. ; Saptawati, G. A. Putri ; Mengko, T.L.E.R.
Author_Institution
Sch. of Electr. Eng. & Inf., Inst. Teknol. Bandung, Bandung, Indonesia
fYear
2011
fDate
17-19 July 2011
Firstpage
1
Lastpage
5
Abstract
In the recent years, Genome-Wide Association Study (GWAS) has been performed by many scientist around the world to find association between genetic profiles of different individuals with the risk of developing certain diseases. GWAS are performed using the Single Nucleotide Polymorphism (SNP) data which represents the genotypes of two different groups of individuals: the case group of individuals with the disease and the control group of individuals without the disease. The very high dimensional SNP data poses challenges in analyzing GWAS result. This issue can be tackled by performing feature ranking to remove non-relevant features for reducing the dimension of the original data. This work compares several feature ranking methods including the chi-square statistics, information gain, recursive feature elimination and Relief algorithm by analyzing the performance of different learning machines combined with the feature ranking. The highest performance is gained by combining recursive feature elimination with linear SVM while the worst performance is shown by the Relief algorithm. The experiments show that the classifiers generally benefit from the feature selection, but that the highest ranked features are not the best classifier.
Keywords
diseases; feature extraction; genetics; genomics; medical computing; GWAS; SNP; chi-square statistics; dimension reduction; diseases; feature ranking methods; genetic profiles; genome-wide association study; relief algorithm; single nucleotide polymorphism; Accuracy; Bioinformatics; Classification algorithms; Diseases; Machine learning; Support vector machines; Training; GWAS; SNP; classification; dimension reduction; feature ranking; feature selection;
fLanguage
English
Publisher
ieee
Conference_Titel
Electrical Engineering and Informatics (ICEEI), 2011 International Conference on
Conference_Location
Bandung
ISSN
2155-6822
Print_ISBN
978-1-4577-0753-7
Type
conf
DOI
10.1109/ICEEI.2011.6021621
Filename
6021621
Link To Document