DocumentCode
465838
Title
Multi-class Protein Sequence Classification Using Fuzzy ARTMAP
Author
Mohamed, Shakir ; Rubin, David ; Marwala, Tshilidzi
Author_Institution
Univ. of the Witwatersrand, Johannesburg
Volume
2
fYear
2006
fDate
8-11 Oct. 2006
Firstpage
1676
Lastpage
1681
Abstract
The classification of protein sequences into families is an important tool in the annotation of structural and functional properties to newly discovered proteins. We present a classification system using pattern recognition techniques to create a numerical vector representation of a protein sequence and then classify the sequence into a number of given families. We introduce the use of fuzzy ARTMAP classifiers and show that coupled with the genetic algorithm based feature subset selection, the system is able to classify protein sequences with an accuracy of 93%. This accuracy is compared with numerous other classification tools and demonstrates that the fuzzy ARTMAP is suitable due to its high accuracy, quick training times and ability for incremental learning.
Keywords
biology computing; genetic algorithms; pattern classification; proteins; fuzzy ARTMAP classifiers; genetic algorithm; multiclass protein sequence classification; pattern recognition techniques; protein sequences classification; Amino acids; Cybernetics; Databases; Fuzzy systems; Genetic algorithms; Humans; Pattern recognition; Protein engineering; Protein sequence; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man and Cybernetics, 2006. SMC '06. IEEE International Conference on
Conference_Location
Taipei
Print_ISBN
1-4244-0099-6
Electronic_ISBN
1-4244-0100-3
Type
conf
DOI
10.1109/ICSMC.2006.384960
Filename
4274094
Link To Document