Title :
Mining human mitochondrial and mitochondrial associated proteins based on SVM and neural network
Author :
Li, Guangrong ; Huang, Zhong ; Xia, Jiali ; Lu, Caimei
Author_Institution :
Sch. of Comput., Wuhan Univ., Wuhan
Abstract :
Human mitochondrial proteins are involved in fundamental biological process including apoptosis, energy production and many metabolic pathways, prediction of mitochondrial proteins is a major challenge in genome annotation. In this study, we implemented a machine learning approach and developed reliable neural network and SVM based methods to classify human mitochondria proteins with high confidence. We used experimentally characterized human mitochondria proteins as positive training datasets and human proteins localized in other organelles as negative training datasets for neural network, support vector machine, naive bayes and bayes network classification. In addition, we constructed simple amino acid composition model, a hybrid model of simple amino acid composition combining amino acid chemical-physical properties, and dipeptide amino acid composition model. With 5 fold cross-validations, the results demonstrate that multiple perceptrone neural network performs better than SVM in all three training models. We concluded that our classification approach utilizing empirically characterized human mitochondria protein sequences is a valuable tool for classifying human mitochondria proteins.
Keywords :
Bayes methods; biology computing; data mining; genetics; learning (artificial intelligence); neural nets; proteins; support vector machines; Bayes network classification; SVM; apoptosis; chemical-physical property; dipeptide amino acid composition model; energy production; fundamental biological process; genome annotation; human mitochondria protein sequences; human mitochondria proteins classification; human mitochondrial mining; human mitochondrial proteins; machine learning approach; metabolic pathways; mitochondrial associated proteins; mitochondrial protein prediction; multiple perceptrone neural network; naive Bayes; negative training datasets; organelles; support vector machine; Amino acids; Bioinformatics; Biological processes; Genomics; Humans; Neural networks; Production; Proteins; Support vector machine classification; Support vector machines;
Conference_Titel :
Granular Computing, 2008. GrC 2008. IEEE International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4244-2512-9
Electronic_ISBN :
978-1-4244-2513-6
DOI :
10.1109/GRC.2008.4664722