DocumentCode :
177592
Title :
Dual Fuzzy Hypergraph Regularized Multi-label Learning for Protein Subcellular Location Prediction
Author :
Jing Chen ; Yuan Yan Tang ; Chen, C.L.P. ; Yuewei Lin
Author_Institution :
Fac. of Sci. & Technol., Univ. of Macau, Macau, China
fYear :
2014
fDate :
24-28 Aug. 2014
Firstpage :
512
Lastpage :
516
Abstract :
With the explosion of newly found proteins, it is necessary and urgent to develop automated computational methods for protein sub cellular location prediction. In particular, the problem of predictor construction for multi-location proteins is challenging. Considering the main limitations of the existing methods, we propose a hierarchical multi-label learning model FHML for both single-location proteins and multi-location proteins. In this model, feature space is firstly decomposed onto a set of nonnegative bases under the nonnegative data factorization framework. The nonnegative bases act as latent feature concepts and the corresponding coefficients on these bases are views as the new feature representation on the latent feature concepts. The similar decomposition is later performed in label space, and then the latent label concepts are extracted. Using these latent concepts as hyper edges, we construct dual fuzzy hyper graphs to exploit the intrinsic high-order relations embedded in both feature space and label space. Finally, the sub cellular location annotation information is propagated from the labeled proteins to the unlabeled proteins by performing dual fuzzy hyper graph Laplacian regularization. In this work, our proposed method is evaluated on eukaryotic protein benchmark dataset, and the experimental results have shown its effectiveness.
Keywords :
feature extraction; fuzzy set theory; graph theory; learning (artificial intelligence); proteins; FHML; automated computational methods; dual fuzzy hypergraph Laplacian regularization; dual fuzzy hypergraph regularized multilabel learning; eukaryotic protein benchmark dataset; feature representation; feature space decomposed; hierarchical multilabel learning model; high-order relations; label space; latent label concepts; multilocation proteins; nonnegative data factorization framework; predictor construction; protein subcellular location prediction; single-location proteins; subcellular location annotation information; Feature extraction; Hidden Markov models; Laplace equations; Predictive models; Proteins; Transforms; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2014 22nd International Conference on
Conference_Location :
Stockholm
ISSN :
1051-4651
Type :
conf
DOI :
10.1109/ICPR.2014.97
Filename :
6976808
Link To Document :
بازگشت