DocumentCode :
3673175
Title :
REFBSS: Reference based similarity search in biological network databases
Author :
Arda Soylev;Osman Abul
Author_Institution :
Department of Computer Engineering, Necmettin Erbakan University, Konya, Turkey
fYear :
2015
Firstpage :
1
Lastpage :
8
Abstract :
Biological networks, mostly abstracted as graphs, are key to many important activities inside the cell. Similarity-based analysis is one of the techniques for understanding the role of a query network. In that context, a database consisting of biological networks is aligned with a query network and the networks having a similarity score higher and lower than a predefined cutoff value are separated. Because of the NP-complete sub-graph isomorphism problem, nontrivial similarity score calculation is computationally too expensive. To this end, several methods are proposed in the literature for an acceptable solution. Reference-based indexing methods are one of the popular solutions which indexes the network database by extracting small sized networks as references to be aligned with the query network. Based on this strategy, we propose a novel model that has methodological and heuristic improvements for fast approximate similarity search, which all turn out to be fast and accurate. We also have a high-performance implementation on Hadoop that achieved 11.42 speedup on a Hadoop cluster with 18 cores on a sample KEGG network database.
Keywords :
"Indexing","Upper bound","Biological system modeling","Approximation algorithms","Accuracy"
Publisher :
ieee
Conference_Titel :
Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), 2015 IEEE Conference on
Type :
conf
DOI :
10.1109/CIBCB.2015.7300279
Filename :
7300279
Link To Document :
بازگشت