• DocumentCode
    735086
  • Title

    Applications of four machine learning algorithms in identifying bacterial essential genes based on composition features

  • Author

    Yan-Yan Deng ; Feng-Biao Guo

  • Author_Institution
    Health Big Data Sci. Res. Center, Univ. of Electron. Sci. & Technol. of China, Chengdu, China
  • fYear
    2015
  • fDate
    12-15 July 2015
  • Firstpage
    821
  • Lastpage
    825
  • Abstract
    Essential genes play vital roles in bacterial survival and they are potential antimicrobial targets and cornerstones of synthetic biology. Accurate recognition of bacterial essential genes by computational methods becomes necessary because of high economical and time consumption in wet experiments. In this paper, we evaluated the effectiveness of four machine learning methods that are Support Vector Machine (SVM), SVM after student´s t test (ttSVM), Principal Component Regression (PCR) and Kernel Principal Component Regression (KPCR), in identifying bacterial essential genes. A total of 24 bacterial genomes were involved and 544 compositional features, generated from the primary genome sequence in each genome. For convenience of the majority of experimental scientists to compare the effectiveness of the four methods, a web server has been constructed, which is freely available at http://cefg.uestc.edu.cn/ibeg.
  • Keywords
    biology computing; genetics; learning (artificial intelligence); principal component analysis; regression analysis; support vector machines; KPCR; SVM after student´s t test; Web server; antimicrobial targets; bacterial essential gene identification; bacterial essential gene recognition; bacterial survival; composition features; computational methods; kernel principal component regression; machine learning algorithms; primary genome sequence; support vector machine; synthetic biology; ttSVM; Decision support systems; Genetics; Indexes; Intelligent systems; Kernel; Microorganisms; Balance dataset; Compositional features; Identifying essential genes; Machine learning methods; Unbalance dataset;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal and Information Processing (ChinaSIP), 2015 IEEE China Summit and International Conference on
  • Conference_Location
    Chengdu
  • Type

    conf

  • DOI
    10.1109/ChinaSIP.2015.7230519
  • Filename
    7230519