• DocumentCode
    2989626
  • Title

    Integrated analysis of gene expression and genome-wide DNA methylation for tumor prediction: An association rule mining-based approach

  • Author

    Mallik, S. ; Mukhopadhyay, Amit ; Maulik, Ujjwal ; Bandyopadhyay, Supriyo

  • Author_Institution
    Machine Intell. Unit, Indian Stat. Inst., Kolkata, India
  • fYear
    2013
  • fDate
    16-19 April 2013
  • Firstpage
    120
  • Lastpage
    127
  • Abstract
    Statistical analysis and association rule mining are two most efficient techniques, where the first one is used to identify differentially expressed/methylated genes across different types of samples or experimental conditions and the second one is used to determine expression/methylation relationships among them. In this article, we have performed an integrated analysis of statistical methods and association rule mining on mRNA expression and DNA methylation datasets for the prediction of Uterine Leiomyoma. Moreover, we have proposed a novel rule-base classifier. Depending on 16 different rule-interestingness measures, we have applied a Genetic Algorithm based rank aggregation technique on the association rules which are generated from the training data by Apriori association rule mining algorithm. After determining the ranks of the rules, we have conducted a majority voting technique on each test point to determine its class-label (i.e. tumor or normal class-label) through weighted-sum method. We have run this classifier on the combined dataset using k-fold cross-validation and also performed a comparative performance analysis with other popular rule-base classifiers. Finally, we have predicted the status of some important genes (through frequency analysis in association rules for tumor and normal class-labels individually) that have a major role for tumor formation in Uterine Leiomyoma.
  • Keywords
    DNA; biology computing; data mining; diseases; genetic algorithms; genetics; pattern classification; tumours; DNA methylation datasets; apriori association rule mining algorithm; association rule mining-based approach; association rules; combined dataset; comparative performance analysis; expressed genes; expression relationship; frequency analysis; gene expression; genetic algorithm; genome-wide DNA methylation; integrated analysis; k-fold cross-validation; mRNA expression; majority voting technique; methylated genes; methylation relationship; normal class-labels; rank aggregation technique; rule-base classifiers; rule-interestingness measures; statistical analysis; statistical methods; tumor formation; tumor prediction; uterine leiomyoma; weighted-sum method; Bioinformatics; Computational biology; Computational intelligence; Decision support systems; Handheld computers; Sensitivity; Tumors; DNA methylation; Gene expression; Genetic Algorithm (GA) based rank aggregation; association rule mining; rule-interestingness measures; statistical tests;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), 2013 IEEE Symposium on
  • Conference_Location
    Singapore
  • Type

    conf

  • DOI
    10.1109/CIBCB.2013.6595397
  • Filename
    6595397