• DocumentCode
    534908
  • Title

    Identifying nuclear protein subcellular localization using feature dimension reduction method

  • Author

    Wang, Tong ; Huang, Qinghua ; Hu, Lihua

  • Author_Institution
    Inst. of Comput. & Inf., Shanghai Second Polytech. Univ., Shanghai, China
  • Volume
    1
  • fYear
    2010
  • fDate
    13-14 Sept. 2010
  • Firstpage
    329
  • Lastpage
    332
  • Abstract
    The subcellular location of a protein is closely correlated to its function. Facing the deluge of protein sequences generated in the post-genomic age, it is necessary to develop useful machine learning tools to identify the protein subcellular localization. DR (Dimensional Reduction) method is one of most famous machine learning tools. Some researchers have begun to explore DR method for computer vision problems such as face recognition, few such attempts have been made for classification of high-dimensional protein data sets. In this paper, DR method is employed to reduce the size of the features space. Comparison between linear DR methods (PCA and LDA) and nonlinear DR methods (KPCA and KLDA) is performed to predict subcellular localization of nuclear proteins. Experimental results thus obtained are quite encouraging, which indicate that the DR method is used effectively to deal with this complicated problem of viral proteins subcellular localization prediction. The overall jackknife success rate with KLDA is the highest relative to the other DR methods.
  • Keywords
    bioinformatics; cellular biophysics; data reduction; learning (artificial intelligence); principal component analysis; proteins; KLDA; KPCA; feature dimension reduction method; feature space size reduction; linear DR method comparison; machine learning tools; nonlinear DR method; nuclear protein subcellular localization; protein function; protein sequences; viral protein subcellular localization prediction; Bioinformatics; Genomics; Proteins; Feature dimension reduction; PSSM(Position-Specific Score Matrix); Subcellular localization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Natural Computing Proceedings (CINC), 2010 Second International Conference on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-1-4244-7705-0
  • Type

    conf

  • DOI
    10.1109/CINC.2010.5643828
  • Filename
    5643828