• DocumentCode
    465838
  • Title

    Multi-class Protein Sequence Classification Using Fuzzy ARTMAP

  • Author

    Mohamed, Shakir ; Rubin, David ; Marwala, Tshilidzi

  • Author_Institution
    Univ. of the Witwatersrand, Johannesburg
  • Volume
    2
  • fYear
    2006
  • fDate
    8-11 Oct. 2006
  • Firstpage
    1676
  • Lastpage
    1681
  • Abstract
    The classification of protein sequences into families is an important tool in the annotation of structural and functional properties to newly discovered proteins. We present a classification system using pattern recognition techniques to create a numerical vector representation of a protein sequence and then classify the sequence into a number of given families. We introduce the use of fuzzy ARTMAP classifiers and show that coupled with the genetic algorithm based feature subset selection, the system is able to classify protein sequences with an accuracy of 93%. This accuracy is compared with numerous other classification tools and demonstrates that the fuzzy ARTMAP is suitable due to its high accuracy, quick training times and ability for incremental learning.
  • Keywords
    biology computing; genetic algorithms; pattern classification; proteins; fuzzy ARTMAP classifiers; genetic algorithm; multiclass protein sequence classification; pattern recognition techniques; protein sequences classification; Amino acids; Cybernetics; Databases; Fuzzy systems; Genetic algorithms; Humans; Pattern recognition; Protein engineering; Protein sequence; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man and Cybernetics, 2006. SMC '06. IEEE International Conference on
  • Conference_Location
    Taipei
  • Print_ISBN
    1-4244-0099-6
  • Electronic_ISBN
    1-4244-0100-3
  • Type

    conf

  • DOI
    10.1109/ICSMC.2006.384960
  • Filename
    4274094