• DocumentCode
    610390
  • Title

    Top-k graph pattern matching over large graphs

  • Author

    Jiefeng Cheng ; Xianggang Zeng ; Yu, Jeffrey Xu

  • Author_Institution
    Shenzhen Inst. of Adv. Technol., Shenzhen, China
  • fYear
    2013
  • fDate
    8-12 April 2013
  • Firstpage
    1033
  • Lastpage
    1044
  • Abstract
    There exist many graph-based applications including bioinformatics, social science, link analysis, citation analysis, and collaborative work. All need to deal with a large data graph. Given a large data graph, in this paper, we study finding top-k answers for a graph pattern query (kGPM), and in particular, we focus on top-k cyclic graph queries where a graph query is cyclic and can be complex. The capability of supporting kGPM provides much more flexibility for a user to search graphs. And the problem itself is challenging. In this paper, we propose a new framework of processing kGPM with on-the-fly ranked lists based on spanning trees of the cyclic graph query. We observe a multidimensional representation for using multiple ranked lists to answer a given kGPM query. Under this representation, we propose a cost model to estimate the least number of tree answers to be consumed in each ranked list for a given kGPM query. This leads to a query optimization approach for kGPM processing, and a top-k algorithm to process kGPM with the optimal query plan. We conducted extensive performance studies using a synthetic dataset and a real dataset, and we confirm the efficiency of our proposed approach.
  • Keywords
    pattern matching; query processing; tree searching; trees (mathematics); bioinformatics; citation analysis; collaborative work; cost model; data graph; graph pattern query; graph-based application; link analysis; multidimensional representation; multiple ranked list; optimal query plan; query optimization; search graph; social science; spanning trees; top-k answer finding; top-k cyclic graph query; top-k graph pattern matching; tree answer; Bioinformatics; Medical services; Optimization; Pattern matching; Query processing; Social network services;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2013 IEEE 29th International Conference on
  • Conference_Location
    Brisbane, QLD
  • ISSN
    1063-6382
  • Print_ISBN
    978-1-4673-4909-3
  • Electronic_ISBN
    1063-6382
  • Type

    conf

  • DOI
    10.1109/ICDE.2013.6544895
  • Filename
    6544895