DocumentCode
177592
Title
Dual Fuzzy Hypergraph Regularized Multi-label Learning for Protein Subcellular Location Prediction
Author
Jing Chen ; Yuan Yan Tang ; Chen, C.L.P. ; Yuewei Lin
Author_Institution
Fac. of Sci. & Technol., Univ. of Macau, Macau, China
fYear
2014
fDate
24-28 Aug. 2014
Firstpage
512
Lastpage
516
Abstract
With the explosion of newly found proteins, it is necessary and urgent to develop automated computational methods for protein sub cellular location prediction. In particular, the problem of predictor construction for multi-location proteins is challenging. Considering the main limitations of the existing methods, we propose a hierarchical multi-label learning model FHML for both single-location proteins and multi-location proteins. In this model, feature space is firstly decomposed onto a set of nonnegative bases under the nonnegative data factorization framework. The nonnegative bases act as latent feature concepts and the corresponding coefficients on these bases are views as the new feature representation on the latent feature concepts. The similar decomposition is later performed in label space, and then the latent label concepts are extracted. Using these latent concepts as hyper edges, we construct dual fuzzy hyper graphs to exploit the intrinsic high-order relations embedded in both feature space and label space. Finally, the sub cellular location annotation information is propagated from the labeled proteins to the unlabeled proteins by performing dual fuzzy hyper graph Laplacian regularization. In this work, our proposed method is evaluated on eukaryotic protein benchmark dataset, and the experimental results have shown its effectiveness.
Keywords
feature extraction; fuzzy set theory; graph theory; learning (artificial intelligence); proteins; FHML; automated computational methods; dual fuzzy hypergraph Laplacian regularization; dual fuzzy hypergraph regularized multilabel learning; eukaryotic protein benchmark dataset; feature representation; feature space decomposed; hierarchical multilabel learning model; high-order relations; label space; latent label concepts; multilocation proteins; nonnegative data factorization framework; predictor construction; protein subcellular location prediction; single-location proteins; subcellular location annotation information; Feature extraction; Hidden Markov models; Laplace equations; Predictive models; Proteins; Transforms; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition (ICPR), 2014 22nd International Conference on
Conference_Location
Stockholm
ISSN
1051-4651
Type
conf
DOI
10.1109/ICPR.2014.97
Filename
6976808
Link To Document