Title :
Recognition of microRNA-binding sites in proteins from sequences using Laplacian Support Vector Machines with a hybrid feature
Author :
Jiansheng Wu ; Wei Han ; Dong Hu ; Xin Xu ; Shancheng Yan ; Lihua Tang
Author_Institution :
Sch. of Geogr. & Biol. Inf., Nanjing Univ. of Posts & Telecommun., Nanjing, China
Abstract :
The recognition of microRNA (miRNA)-binding residues in proteins would further enhance our understanding of how miRNAs silence their target genes and some relevant biological processes. Due to the insufficient labeled examples, traditional methods such as SVMs could not work well on such problems. Thus, we propose a semi-supervised learning method, i.e., Laplacian Support Vector Machine (LapSVM) for recognizing miRNA-binding residues in proteins from sequences by making use of both labeled and unlabeled data in this article. A hybrid feature is put forward for coding instances which incorporates evolutionary information of the amino acid sequence and mutual interaction propensities in protein-miRNA complex structures. The results indicate that the LapSVM model receives good performance with a F1 score of 22.06±0.28% and an AUC (area under the ROC curve) value of 0.760±0.043. A web server called MBindR is built and freely available at http:// cbi.njupt.edu.cn/MBindR/MBindR.htm for academic usage.
Keywords :
Internet; RNA; biochemistry; bioinformatics; bonds (chemical); evolution (biological); genetics; learning (artificial intelligence); molecular biophysics; molecular configurations; proteins; sequences; support vector machines; AUC value; F1 score; LapSVM hybrid feature; LapSVM model performance; Laplacian support vector machines; MBindR web server; amino acid sequence; area under the ROC curve value; data labelling; evolutionary information; gene silencing; miRNA-binding residue recognition; microRNA-binding site recognition; mutual interaction propensities; protein sequences; protein-miRNA complex structures; semisupervised learning method; Amino acids; Prediction algorithms; Predictive models; Proteins; Reliability; Support vector machines; Training; Laplacian Support Vector Machine; evolutionary information; miRNA-binding residues; mutual interaction propensities;
Conference_Titel :
Biomedical Engineering and Informatics (BMEI), 2013 6th International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4799-2760-9
DOI :
10.1109/BMEI.2013.6746990