• DocumentCode
    3573987
  • Title

    Genetic algorithm for an optimized weighted voting scheme incorporating k-separated bigram transition probabilities to improve protein fold recognition

  • Author

    Saini, Harsh ; Raicar, Gaurav ; Lal, Sunil ; Dehzangi, Abdollah ; Lyons, James ; Paliwal, Kuldip K. ; Imoto, Seiya ; Miyano, Satoru ; Sharma, Alok

  • Author_Institution
    Univ. of the South Pacific, Suva, Fiji
  • fYear
    2014
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    In biology, identifying the tertiary structure of a protein helps determine its functions. A step towards tertiary structure identification is predicting a protein´s fold. Computational methods have been applied to determine a protein´s fold by assembling information from its structural, physicochemical and/or evolutionary properties. It has been shown that evolutionary data helps improve prediction accuracy. In this study, a scheme is proposed that uses the genetic algorithm (GA) to optimize a weighted voting system to improve protein fold recognition. This scheme incorporates k-separated bigram transition probabilities for feature extraction, which are based on the Position Specific Scoring Matrix (PSSM). A set of SVM classifiers are used for initial classification, whereupon their predictions are consolidated using the optimized weighted voting system. This scheme has been demonstrated on the Ding and Dubchak (DD) benchmarked data set.
  • Keywords
    biology computing; feature extraction; genetic algorithms; matrix algebra; probability; proteins; support vector machines; SVM classifiers; biology; evolutionary property; feature extraction; genetic algorithm; k-separated bigram transition probability; optimized weighted voting scheme; physicochemical property; position specific scoring matrix; protein fold recognition; tertiary structure identification; Accuracy; Amino acids; Feature extraction; Genetic algorithms; Proteins; Support vector machines; Training; Genetic Algorithm; PSSM; Protein fold recognition; SCOP; Support Vector Machines; k-separated bigrams;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Engineering (APWC on CSE), 2014 Asia-Pacific World Congress on
  • Print_ISBN
    978-1-4799-1955-0
  • Type

    conf

  • DOI
    10.1109/APWCCSE.2014.7053846
  • Filename
    7053846