DocumentCode :
3749261
Title :
Prediction of protein cellular localization site by using data mining techniques
Author :
Bhanu Priya;Amit Chhabra
Author_Institution :
Dept. of Computer Engineering & Technology, Guru Nanak Dev University, Amritsar, Punjab, India
fYear :
2015
Firstpage :
731
Lastpage :
736
Abstract :
In recent years data mining has been intensively used in the medical field, bioinformatics and biomedical research. This paper focus on the prediction of the protein localization site on the basis of their amino acid sequences in Escherichia Coli (E coli) bacteria. This is of great importance because information on cellular location is helpful for annotation of proteins and genes. So there is need to develop a simple method with high prediction accuracy. To accomplish this various classification techniques are considered on the dataset by performing various experiments. Based on these experiments various measures such as classification accuracy, error rate, F-Measure, etc are calculated. The dataset used is the E coli dataset. The maximum accuracy is achieved by the proposed hybrid model of Support Vector Machine and the LogitBoost technique. The classification accuracy achieved by this model is 95.23%.
Keywords :
"Proteins","Data mining","Classification algorithms","Support vector machines","Protein engineering","Biomembranes","Heuristic algorithms"
Publisher :
ieee
Conference_Titel :
Computing and Network Communications (CoCoNet), 2015 International Conference on
Type :
conf
DOI :
10.1109/CoCoNet.2015.7411271
Filename :
7411271
Link To Document :
بازگشت