Title :
Incremental Learning for Classification of Protein Sequences
Author :
Mohamed, Shakir ; Rubin, David ; Marwala, Tshilidzi
Author_Institution :
Univ. of the Witwatersrand, Johannesburg
Abstract :
The problem of protein structural family classification remains a core problem in computational biology, with application of this technology applicable to problems in drug discovery programs and hypothetical protein annotation. Many machine learning tools have been applied to this problem using static machine learning structures such as neural networks or support vector machines that are unable to accommodate new information into their existing models. We utilize the fuzzy ARTMAP as an alternate machine learning system that has the ability of incrementally learning new data as it becomes available. The fuzzy ARTMAP is found to be comparable to many of the widespread machine learning systems. The use of an evolutionary strategy in the selection and combination of individual classifiers into an ensemble system, coupled with the incremental learning ability of the fuzzy ARTMAP is proven to be suitable as a pattern classifier. The algorithm presented is tested using data from the G-Coupled Protein Receptors Database and shows good accuracy of 83%.
Keywords :
biological techniques; biology computing; learning (artificial intelligence); proteins; computational biology; drug discovery programs; fuzzy ARTMAP; hypothetical protein annotation; incremental learning; machine learning tools; protein sequences classification; protein structural family classification; widespread machine learning systems; Computational biology; Drugs; Fuzzy systems; Learning systems; Machine learning; Neural networks; Pharmaceutical technology; Proteins; Sequences; Support vector machines;
Conference_Titel :
Neural Networks, 2007. IJCNN 2007. International Joint Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
978-1-4244-1379-9
Electronic_ISBN :
1098-7576
DOI :
10.1109/IJCNN.2007.4370924