DocumentCode :
2989626
Title :
Integrated analysis of gene expression and genome-wide DNA methylation for tumor prediction: An association rule mining-based approach
Author :
Mallik, S. ; Mukhopadhyay, Amit ; Maulik, Ujjwal ; Bandyopadhyay, Supriyo
Author_Institution :
Machine Intell. Unit, Indian Stat. Inst., Kolkata, India
fYear :
2013
fDate :
16-19 April 2013
Firstpage :
120
Lastpage :
127
Abstract :
Statistical analysis and association rule mining are two most efficient techniques, where the first one is used to identify differentially expressed/methylated genes across different types of samples or experimental conditions and the second one is used to determine expression/methylation relationships among them. In this article, we have performed an integrated analysis of statistical methods and association rule mining on mRNA expression and DNA methylation datasets for the prediction of Uterine Leiomyoma. Moreover, we have proposed a novel rule-base classifier. Depending on 16 different rule-interestingness measures, we have applied a Genetic Algorithm based rank aggregation technique on the association rules which are generated from the training data by Apriori association rule mining algorithm. After determining the ranks of the rules, we have conducted a majority voting technique on each test point to determine its class-label (i.e. tumor or normal class-label) through weighted-sum method. We have run this classifier on the combined dataset using k-fold cross-validation and also performed a comparative performance analysis with other popular rule-base classifiers. Finally, we have predicted the status of some important genes (through frequency analysis in association rules for tumor and normal class-labels individually) that have a major role for tumor formation in Uterine Leiomyoma.
Keywords :
DNA; biology computing; data mining; diseases; genetic algorithms; genetics; pattern classification; tumours; DNA methylation datasets; apriori association rule mining algorithm; association rule mining-based approach; association rules; combined dataset; comparative performance analysis; expressed genes; expression relationship; frequency analysis; gene expression; genetic algorithm; genome-wide DNA methylation; integrated analysis; k-fold cross-validation; mRNA expression; majority voting technique; methylated genes; methylation relationship; normal class-labels; rank aggregation technique; rule-base classifiers; rule-interestingness measures; statistical analysis; statistical methods; tumor formation; tumor prediction; uterine leiomyoma; weighted-sum method; Bioinformatics; Computational biology; Computational intelligence; Decision support systems; Handheld computers; Sensitivity; Tumors; DNA methylation; Gene expression; Genetic Algorithm (GA) based rank aggregation; association rule mining; rule-interestingness measures; statistical tests;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), 2013 IEEE Symposium on
Conference_Location :
Singapore
Type :
conf
DOI :
10.1109/CIBCB.2013.6595397
Filename :
6595397
Link To Document :
بازگشت