DocumentCode
3453898
Title
Feature selections using AdaBoost: Application in gene-gene interaction detection
Author
Assareh, A. ; Volkert, L.G. ; Jing Li
Author_Institution
CS Dept., Kent State Univ., Kent, OH, USA
fYear
2012
fDate
4-7 Oct. 2012
Firstpage
831
Lastpage
837
Abstract
One of the main goals of genome wide association studies (GWAS) has been detecting the gene-gene interactions, also known as epistasis in a broad sense, underlying complex diseases. The ability of decision trees and their ensembles to capture interactions among input variable has attracted attention among computational biologists for this aim. However, individual decision trees suffer from some limitations including data fragmentation and representational problem that can impact the epistasis detection performance of their ensembles when not taken into account. Here we take a closer look at feature selection capability of AdaBoost in the realm of epistasis detection and the effect of tuning the weak classifiers on its performance. We also explore the efficacy of applying different statistical and information theoretic strategies in tandem with AdaBoost in order to improve its performance. The results show that the performance of AdaBoost is more sensitive to the parameters settings of the weak learner when risk allele frequencies are low, which can be explained with respect to the data fragmentation phenomenon. Also depending on the model of interaction between the risk SNPs different criterion might excel in the second stage.
Keywords
biology computing; data structures; decision trees; genomics; learning (artificial intelligence); pattern classification; AdaBoost; GWAS; computational biologist; data fragmentation problem; data representational problem; decision trees; ensemble learning; epistasis; feature selection; gene-gene interaction detection; genome wide association studies; weak classifier tuning; Additives; Bioinformatics; Data models; Decision trees; Diseases; Logistics; Mutual information; AdaBoost; GWAS; decision trees; epistasis;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Biomedicine Workshops (BIBMW), 2012 IEEE International Conference on
Conference_Location
Philadelphia, PA
Print_ISBN
978-1-4673-2746-6
Electronic_ISBN
978-1-4673-2744-2
Type
conf
DOI
10.1109/BIBMW.2012.6470248
Filename
6470248
Link To Document