• DocumentCode
    659474
  • Title

    A hypergraph-partitioned vertex programming approach for large-scale consensus optimization

  • Author

    Hui Miao ; Xiangyang Liu ; Huang, Bo ; Getoor, Lise

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Maryland, College Park, MD, USA
  • fYear
    2013
  • fDate
    6-9 Oct. 2013
  • Firstpage
    563
  • Lastpage
    568
  • Abstract
    In modern data science problems, techniques for extracting value from big data require performing large-scale optimization over heterogenous, irregularly structured data. Much of this data is best represented as multi-relational graphs, making vertex-programming abstractions such as those of Pregel and GraphLab ideal fits for modern large-scale data analysis. In this paper, we describe a vertex-programming implementation of a popular consensus optimization technique known as the alternating direction method of multipliers (ADMM) [1]. ADMM consensus optimization allows the elegant solution of complex objectives such as inference in rich probabilistic models. We also introduce a novel hypergraph partitioning technique that improves over the state-of-the-art vertex programming framework and significantly reduces the communication cost by reducing the number of replicated nodes by an order of magnitude. We implement our algorithm in GraphLab and measure scaling performance on a variety of realistic bipartite graphs and a large synthetic voter-opinion analysis application. We show a 50% improvement in running time over the current GraphLab partitioning scheme.
  • Keywords
    data analysis; data structures; graph theory; graphs; optimisation; ADMM consensus optimization; GraphLab partitioning scheme; alternating direction method of multipliers; big data; data science problems; hypergraph partitioned vertex programming framework; hypergraph partitioning; large scale consensus optimization; large scale data analysis; large scale optimization; large synthetic voter opinion analysis application; multirelational graphs; probabilistic models; realistic bipartite graphs; structured data; vertex programming abstractions; Bipartite graph; Convergence; Optimization; Partitioning algorithms; Probabilistic logic; Programming; Vectors; consensus optimization; large-scale optimization; partitioning methods; vertex programming;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Big Data, 2013 IEEE International Conference on
  • Conference_Location
    Silicon Valley, CA
  • Type

    conf

  • DOI
    10.1109/BigData.2013.6691623
  • Filename
    6691623