• DocumentCode
    177592
  • Title

    Dual Fuzzy Hypergraph Regularized Multi-label Learning for Protein Subcellular Location Prediction

  • Author

    Jing Chen ; Yuan Yan Tang ; Chen, C.L.P. ; Yuewei Lin

  • Author_Institution
    Fac. of Sci. & Technol., Univ. of Macau, Macau, China
  • fYear
    2014
  • fDate
    24-28 Aug. 2014
  • Firstpage
    512
  • Lastpage
    516
  • Abstract
    With the explosion of newly found proteins, it is necessary and urgent to develop automated computational methods for protein sub cellular location prediction. In particular, the problem of predictor construction for multi-location proteins is challenging. Considering the main limitations of the existing methods, we propose a hierarchical multi-label learning model FHML for both single-location proteins and multi-location proteins. In this model, feature space is firstly decomposed onto a set of nonnegative bases under the nonnegative data factorization framework. The nonnegative bases act as latent feature concepts and the corresponding coefficients on these bases are views as the new feature representation on the latent feature concepts. The similar decomposition is later performed in label space, and then the latent label concepts are extracted. Using these latent concepts as hyper edges, we construct dual fuzzy hyper graphs to exploit the intrinsic high-order relations embedded in both feature space and label space. Finally, the sub cellular location annotation information is propagated from the labeled proteins to the unlabeled proteins by performing dual fuzzy hyper graph Laplacian regularization. In this work, our proposed method is evaluated on eukaryotic protein benchmark dataset, and the experimental results have shown its effectiveness.
  • Keywords
    feature extraction; fuzzy set theory; graph theory; learning (artificial intelligence); proteins; FHML; automated computational methods; dual fuzzy hypergraph Laplacian regularization; dual fuzzy hypergraph regularized multilabel learning; eukaryotic protein benchmark dataset; feature representation; feature space decomposed; hierarchical multilabel learning model; high-order relations; label space; latent label concepts; multilocation proteins; nonnegative data factorization framework; predictor construction; protein subcellular location prediction; single-location proteins; subcellular location annotation information; Feature extraction; Hidden Markov models; Laplace equations; Predictive models; Proteins; Transforms; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2014 22nd International Conference on
  • Conference_Location
    Stockholm
  • ISSN
    1051-4651
  • Type

    conf

  • DOI
    10.1109/ICPR.2014.97
  • Filename
    6976808