• DocumentCode
    3459990
  • Title

    A Comparative Study of Ensemble Learning Approaches in the Classification of Breast Cancer Metastasis

  • Author

    Zhang, Wangshu ; Zeng, Feng ; Wu, Xuebing ; Zhang, Xuegong ; Jiang, Rui

  • Author_Institution
    Dept. of Autom., Tsinghua Univ., Beijing, China
  • fYear
    2009
  • fDate
    3-5 Aug. 2009
  • Firstpage
    242
  • Lastpage
    245
  • Abstract
    The combined use of gene expression profiles and protein-protein interaction (PPI) networks has recently shed light on breast cancer research by selecting a small number of subnetworks as disease markers and then using them for the classification of metastasis. Based on previously identified subnetwork markers, we compare three ensemble learning approaches (AdaBoost, LogitBoost and random forest) with two widely used classifiers (logistic regression and support vector machine) in the classification of breast cancer metastasis. In leave-one-out cross-validation experiments on two breast cancer data sets, the ensemble learning methods can lead logis-tic regression and support vector machine by 22.4% and 4.8% respectively in terms of the classification accuracy. In cross data set validation experiments, the ensemble learning methods also demonstrate superior reproducibility over the other two methods. With these results, we infer that the ensemble learn-ing approaches with subnetwork markers might be more suit-able in handling the classification problem of breast cancer metastasis, and we recommend the use of these approaches in similar classification problems.
  • Keywords
    biology computing; cancer; genetics; learning (artificial intelligence); medical computing; proteins; support vector machines; AdaBoost; LogitBoost; breast cancer metastasis classification; ensemble learning approaches; gene expression profiles; logistic regression; protein-protein interaction networks; random forest; subnetwork markers; support vector machine; Breast cancer; Diseases; Gene expression; Learning systems; Logistics; Machine learning; Metastasis; Proteins; Support vector machine classification; Support vector machines; breast cancer metastasis; classification; ensemble learning; subnetwork markers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics, Systems Biology and Intelligent Computing, 2009. IJCBS '09. International Joint Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-0-7695-3739-9
  • Type

    conf

  • DOI
    10.1109/IJCBS.2009.23
  • Filename
    5260680