• DocumentCode
    2191580
  • Title

    Design of a Novel Protein Feature and Enzyme Function Classification

  • Author

    Lee, Bum Ju ; Lee, Heon Gyu ; Ryu, Keun Ho

  • Author_Institution
    Database/Bioinf. Lab., Chungbuk Nat. Univ., Cheongju
  • fYear
    2008
  • fDate
    8-11 July 2008
  • Firstpage
    450
  • Lastpage
    455
  • Abstract
    One of the most important researches in bioinformatics and biomedicine is to predict and classify the function of unknown protein. Recently, several studies based on alternative representation of protein have proposed for protein classification and prediction. However, most of these previous studies used only the predicted or global features extracted from protein sequence to assign function of distantly related proteins. Here, we describe a method that can assign enzyme function using features extracted from only protein sequence irrespective of sequence alignment. In our method, we design novel features presenting subtle distinction of local regions in protein sequence. In experimental results, the accuracy of the classifications for one-class versus one-class sub-problems is found in the range of 66.02% to 90.78% by support vector machine (SVM). Moreover, the results demonstrate that most of our features are valuable for enzyme function classification and add support to the facilitation of making discriminative feature set for specific enzyme function by combining traditional and novel features.
  • Keywords
    biology computing; enzymes; feature extraction; support vector machines; bioinformatics; biomedicine; enzyme function classification; feature extraction; protein classification; protein sequence; support vector machine; Enzyme; Feature extraction; Function classification; Negatively charged residues; Positively charged residues; SVM;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Information Technology Workshops, 2008. CIT Workshops 2008. IEEE 8th International Conference on
  • Conference_Location
    Sydney, QLD
  • Print_ISBN
    978-0-7695-3242-4
  • Electronic_ISBN
    978-0-7695-3239-1
  • Type

    conf

  • DOI
    10.1109/CIT.2008.Workshops.59
  • Filename
    4568546