• DocumentCode
    3749261
  • Title

    Prediction of protein cellular localization site by using data mining techniques

  • Author

    Bhanu Priya;Amit Chhabra

  • Author_Institution
    Dept. of Computer Engineering & Technology, Guru Nanak Dev University, Amritsar, Punjab, India
  • fYear
    2015
  • Firstpage
    731
  • Lastpage
    736
  • Abstract
    In recent years data mining has been intensively used in the medical field, bioinformatics and biomedical research. This paper focus on the prediction of the protein localization site on the basis of their amino acid sequences in Escherichia Coli (E coli) bacteria. This is of great importance because information on cellular location is helpful for annotation of proteins and genes. So there is need to develop a simple method with high prediction accuracy. To accomplish this various classification techniques are considered on the dataset by performing various experiments. Based on these experiments various measures such as classification accuracy, error rate, F-Measure, etc are calculated. The dataset used is the E coli dataset. The maximum accuracy is achieved by the proposed hybrid model of Support Vector Machine and the LogitBoost technique. The classification accuracy achieved by this model is 95.23%.
  • Keywords
    "Proteins","Data mining","Classification algorithms","Support vector machines","Protein engineering","Biomembranes","Heuristic algorithms"
  • Publisher
    ieee
  • Conference_Titel
    Computing and Network Communications (CoCoNet), 2015 International Conference on
  • Type

    conf

  • DOI
    10.1109/CoCoNet.2015.7411271
  • Filename
    7411271