• DocumentCode
    3481148
  • Title

    Prediction of protein subcellular localization based on primary sequence data

  • Author

    Özarar, Mert ; Atalay, Volkan ; Atalay, Rengül Cetin

  • Author_Institution
    ODTU Bilgisayar Muhendisligi Bolumu, Turkey
  • fYear
    2004
  • fDate
    28-30 April 2004
  • Firstpage
    118
  • Lastpage
    120
  • Abstract
    Subcellular localization is crucial for determining the functions of proteins. A system called prediction of protein subcellular localization (P2SL) that predicts the subcellular localization of proteins in eukaryotic organisms based on the amino acid content of primary sequences using amino acid order is designed. The approach for prediction is to find the most frequent motifs for each protein in a given class based on clustering via self organizing maps and then to use these most frequent motifs as features for classification by the help of multilayer perceptrons. This approach allows a classification independent of the length of the sequence. In addition to these, the use of a new encoding scheme is described for the amino acids that conserves biological function based on the point of accepted mutations (PAM) substitution matrix. The statistical test results of the system are presented on a four class problem. P2SL achieves slightly higher prediction accuracy than the similar studies.
  • Keywords
    medical signal processing; multilayer perceptrons; pattern classification; pattern clustering; proteins; self-organising feature maps; sequences; P2SL; PAM substitution matrix; amino acid content; amino acid order; biological function; classification; clustering; encoding scheme; eukaryotic organisms; four class problem; multilayer perceptrons; point of accepted mutations; prediction of protein subcellular localization; self organizing maps; statistical test; Amino acids; Biological information theory; Encoding; Genetic mutations; Matrices; Multilayer perceptrons; Organisms; Proteins; Self organizing feature maps; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Communications Applications Conference, 2004. Proceedings of the IEEE 12th
  • Print_ISBN
    0-7803-8318-4
  • Type

    conf

  • DOI
    10.1109/SIU.2004.1338272
  • Filename
    1338272