• DocumentCode
    554090
  • Title

    A genetic algorithm for optimizing subnetwork markers for the study of breast cancer metastasis

  • Author

    Jiaxin Wu ; Mingxin Gan ; Rui Jiang

  • Author_Institution
    Dept. of Autom., Tsinghua Univ., Beijing, China
  • Volume
    3
  • fYear
    2011
  • fDate
    26-28 July 2011
  • Firstpage
    1578
  • Lastpage
    1582
  • Abstract
    The combined use of gene expression profiles and protein-protein interaction networks has shown remarkable successes in the prediction of breast cancer metastases. Nevertheless, as a primary step of network-based methods, the problem of effectively identifying predictive subnetwork markers remains a great challenge. Typically, existing methods use greedy search algorithms to search for subnetworks. This strategy, though efficient in time complexity, may fail in finding the optimal subnetwork markers and accordingly impair the performance of the successive learning machines. In this paper, we propose a genetic algorithm to improve the subnetwork markers that have been identified by an existing greedy search method. We demonstrate that the discriminative power of the optimized subnetwork markers are significantly higher than the original subnetwork markers, and we show that higher classification performance can be achieved when using the optimized subnetworks as predictive features via six popular machine learning approaches (logistic regression, support vector machine, decision tree, Adaboost, random forest and Logitboost). According to the comparison between different classification approaches, Logitboost with the optimized subnetwork markers shows the highest classification performance and optimal reproducibility for identifying breast cancer metastases.
  • Keywords
    cancer; computational complexity; genetic algorithms; genetics; greedy algorithms; learning (artificial intelligence); pattern classification; proteins; search problems; Logitboost; breast cancer metastasis; gene expression profile; genetic algorithm; greedy search algorithm; learning machine; machine learning approach; predictive feature; protein-protein interaction network; subnetwork marker optimization; time complexity; Bioinformatics; Biological cells; Bit error rate; Breast cancer; Genetic algorithms; Metastasis; Support vector machines; gene expression profile; genetic algorithm; protein-protein interaction network; subnetwork marker;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Computation (ICNC), 2011 Seventh International Conference on
  • Conference_Location
    Shanghai
  • ISSN
    2157-9555
  • Print_ISBN
    978-1-4244-9950-2
  • Type

    conf

  • DOI
    10.1109/ICNC.2011.6022270
  • Filename
    6022270