Title :
A comparative study of feature ranking methods as dimension reduction technique in Genome-Wide Association Study
Author :
Ayuningtyas, C.H. ; Saptawati, G. A. Putri ; Mengko, T.L.E.R.
Author_Institution :
Sch. of Electr. Eng. & Inf., Inst. Teknol. Bandung, Bandung, Indonesia
Abstract :
In the recent years, Genome-Wide Association Study (GWAS) has been performed by many scientist around the world to find association between genetic profiles of different individuals with the risk of developing certain diseases. GWAS are performed using the Single Nucleotide Polymorphism (SNP) data which represents the genotypes of two different groups of individuals: the case group of individuals with the disease and the control group of individuals without the disease. The very high dimensional SNP data poses challenges in analyzing GWAS result. This issue can be tackled by performing feature ranking to remove non-relevant features for reducing the dimension of the original data. This work compares several feature ranking methods including the chi-square statistics, information gain, recursive feature elimination and Relief algorithm by analyzing the performance of different learning machines combined with the feature ranking. The highest performance is gained by combining recursive feature elimination with linear SVM while the worst performance is shown by the Relief algorithm. The experiments show that the classifiers generally benefit from the feature selection, but that the highest ranked features are not the best classifier.
Keywords :
diseases; feature extraction; genetics; genomics; medical computing; GWAS; SNP; chi-square statistics; dimension reduction; diseases; feature ranking methods; genetic profiles; genome-wide association study; relief algorithm; single nucleotide polymorphism; Accuracy; Bioinformatics; Classification algorithms; Diseases; Machine learning; Support vector machines; Training; GWAS; SNP; classification; dimension reduction; feature ranking; feature selection;
Conference_Titel :
Electrical Engineering and Informatics (ICEEI), 2011 International Conference on
Conference_Location :
Bandung
Print_ISBN :
978-1-4577-0753-7
DOI :
10.1109/ICEEI.2011.6021621