DocumentCode :
1760916
Title :
Software Suite for Gene and Protein Annotation Prediction and Similarity Search
Author :
Chicco, Davide ; Masseroli, Marco
Author_Institution :
Dipt. di Elettron. Informazionee Bioingegneria, Politec. di Milano, Milan, Italy
Volume :
12
Issue :
4
fYear :
2015
fDate :
July-Aug. 1 2015
Firstpage :
837
Lastpage :
843
Abstract :
In the computational biology community, machine learning algorithms are key instruments for many applications, including the prediction of gene-functions based upon the available biomolecular annotations. Additionally, they may also be employed to compute similarity between genes or proteins. Here, we describe and discuss a software suite we developed to implement and make publicly available some of such prediction methods and a computational technique based upon Latent Semantic Indexing (LSI), which leverages both inferred and available annotations to search for semantically similar genes. The suite consists of three components. BioAnnotationPredictor is a computational software module to predict new gene-functions based upon Singular Value Decomposition of available annotations. SimilBio is a Web module that leverages annotations available or predicted by BioAnnotationPredictor to discover similarities between genes via LSI. The suite includes also SemSim, a new Web service built upon these modules to allow accessing them programmatically. We integrated SemSim in the Bio Search Computing framework (http://www.bioinformatics.deib. polimi.it/bio-seco/seco/), where users can exploit the Search Computing technology to run multi-topic complex queries on multiple integrated Web services. Accordingly, researchers may obtain ranked answers involving the computation of the functional similarity between genes in support of biomedical knowledge discovery.
Keywords :
Web services; bioinformatics; genetics; programming language semantics; proteins; singular value decomposition; Bio Search Computing framework; BioAnnotationPredictor; SemSim; SimilBio; Web module; biomedical knowledge discovery; biomolecular annotations; computational biology community; computational software module; gene annotation prediction; gene function prediction; latent semantic indexing; machine learning algorithms; multiple integrated Web services; multitopic complex queries; protein annotation prediction; semantically similar genes; similarity search; singular value decomposition; software suite; Bioinformatics; Computational biology; Large scale integration; Ontologies; Semantics; Web services; Gene Ontology; Latent Semantic Indexing; Latent semantic indexing; Search Computing; Singular Value Decomposition; Web service; biomolecular annotations; gene ontology; gene similarity search; search computing; semantic similarity; singular value decomposition; web service;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2014.2382127
Filename :
6987347
Link To Document :
بازگشت