DocumentCode :
1548510
Title :
Top-k Similar Graph Matching Using TraM in Biological Networks
Author :
Amin, Md Syedul ; Finley, R.L. ; Jamil, Hasan M.
Author_Institution :
Dept. of Comput. Sci., Wayne State Univ., Sunnyvale, CA, USA
Volume :
9
Issue :
6
fYear :
2012
Firstpage :
1790
Lastpage :
1804
Abstract :
Many emerging database applications entail sophisticated graph-based query manipulation, predominantly evident in large-scale scientific applications. To access the information embedded in graphs, efficient graph matching tools and algorithms have become of prime importance. Although the prohibitively expensive time complexity associated with exact subgraph isomorphism techniques has limited its efficacy in the application domain, approximate yet efficient graph matching techniques have received much attention due to their pragmatic applicability. Since public domain databases are noisy and incomplete in nature, inexact graph matching techniques have proven to be more promising in terms of inferring knowledge from numerous structural data repositories. In this paper, we propose a novel technique called TraM for approximate graph matching that off-loads a significant amount of its processing on to the database making the approach viable for large graphs. Moreover, the vector space embedding of the graphs and efficient filtration of the search space enables computation of approximate graph similarity at a throw-away cost. We annotate nodes of the query graphs by means of their global topological properties and compare them with neighborhood biased segments of the data-graph for proper matches. We have conducted experiments on several real data sets, and have demonstrated the effectiveness and efficiency of the proposed method.
Keywords :
bioinformatics; genetics; graphs; isomorphism; tree searching; TraM; bioinformatics; biological networks; database applications; exact subgraph isomorphism techniques; expensive time complexity; genetics; global topological properties; large-scale scientific; neighborhood biased segments; noisy public domain databases; pragmatic applicability; query graphs; sophisticated graph-based query manipulation; structural data repository; top-k similar graph matching; vector space embedding; Bioinformatics; Computational biology; Databases; Diseases; Genetics; Graphics; Proteins; Graphs and networks; bioinformatics; biology and genetics; graph and tree search strategies; knowledge and data engineering tools and techniques; Algorithms; Animals; Computational Biology; Data Mining; Databases, Factual; Drosophila; Humans; Protein Interaction Maps; Reproducibility of Results;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2012.90
Filename :
6226354
Link To Document :
بازگشت