• DocumentCode
    633076
  • Title

    Techniques for Graph Analytics on Big Data

  • Author

    Nisar, M. Usman ; Fard, Arash ; Miller, John A.

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Georgia, Athens, GA, USA
  • fYear
    2013
  • fDate
    June 27 2013-July 2 2013
  • Firstpage
    255
  • Lastpage
    262
  • Abstract
    Graphs enjoy profound importance because of their versatility and expressivity. They can be effectively used to represent social networks, web search engines and genome sequencing. The field of graph pattern matching has been of significant importance and has wide-spread applications. Conceptually, we want to find subgraphs that match a pattern in a given graph. Much work has been done in this field with solutions like Subgraph Isomorphism and Regular Expression matching. With Big Data, scientists are frequently running into massive graphs that have amplified the challenge that this area poses. We study the speedup and communication behavior of three distributed algorithms for inexact graph pattern matching. We also study the impact of different graph partitionings on runtime and network I/O. Our extensive results show that the algorithms exhibit excellent scalable behavior and min-cut partitioning can lead to improved performance under some circumstances, and can drastically reduce the network traffic as well.
  • Keywords
    computer graphics; data analysis; distributed algorithms; pattern matching; Web search engines; big data; distributed algorithm; genome sequencing; graph analytics; graph partitioning; graph pattern matching; min-cut partitioning; network input-output; regular expression matching; runtime input-output; social networks; subgraph isomorphism; Computational modeling; Data handling; Data models; Distributed algorithms; Global Positioning System; Information management; Pattern matching; graph analytics; big data; graph simulation; parallel and distributed algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Big Data (BigData Congress), 2013 IEEE International Congress on
  • Conference_Location
    Santa Clara, CA
  • Print_ISBN
    978-0-7695-5006-0
  • Type

    conf

  • DOI
    10.1109/BigData.Congress.2013.78
  • Filename
    6597145