• DocumentCode
    2397655
  • Title

    Supervised Dimensionality Reduction Method for Predicting Membrane Proteins Types

  • Author

    Wang, Tong ; Xia, Tian ; Hu, Xiao-Ming

  • Author_Institution
    Inst. of Comput. & Inf., Shanghai Second Polytech. Univ., Shanghai, China
  • Volume
    2
  • fYear
    2010
  • fDate
    26-28 Aug. 2010
  • Firstpage
    112
  • Lastpage
    115
  • Abstract
    With the rapid increase of protein sequences in the post-genomic age, the need for an automated and accurate tool to predict membrane protein types becomes increasingly important. Many efforts have been tried. Most of them aim to find the optimal classification scheme and less of them take the simplifying the complexity of biological system into consideration. This work shows how to decrease the complexity of biological system with the supervised DR (Dimensionality Reduction) method by transforming the original high-dimensional feature vectors into the low-dimensional feature vectors. Moreover, a powerful sequence encoding scheme by fusing PSSM (Position-Specific Score Matrix) and PseAA (Pseudo Amino Acid) method is used to represent the protein samples. Then, the K-NN (K-Nearest Neighbor) classifier is employed to identify the membrane protein types based on their reduced low-dimensional feature vectors. As a result, the jackknife and re-substitution test success rates on this model reach 85.2% and 92.6% respectively, and suggesting that the proposed approach is very promising for predicting membrane proteins types.
  • Keywords
    biology computing; biomembranes; pattern classification; proteins; sequential codes; statistical analysis; vectors; K-NN classifier; K-nearest neighbor classifier; PSSM; PseAA; biological system; jackknife test; membrane protein types; optimal classification scheme; original high-dimensional feature vectors; position-specific score matrix; post-genomic age; protein sequences; pseudo amino acid; re-substitution test; sequence encoding scheme; supervised dimensionality reduction method; Amino acids; Biological systems; Biomembranes; Feature extraction; Proteins; Support vector machine classification; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Human-Machine Systems and Cybernetics (IHMSC), 2010 2nd International Conference on
  • Conference_Location
    Nanjing, Jiangsu
  • Print_ISBN
    978-1-4244-7869-9
  • Type

    conf

  • DOI
    10.1109/IHMSC.2010.127
  • Filename
    5590702