• DocumentCode
    685911
  • Title

    A Method of Data Distribution for Distributed Cross Join

  • Author

    Ping Lu ; Shengmei Luo ; Zhiping Wang ; Wenwu Qu

  • fYear
    2013
  • fDate
    13-15 Dec. 2013
  • Firstpage
    105
  • Lastpage
    109
  • Abstract
    One of the major challenges in big data processing is the efficiency of cross join, such as the similarity calculation in business intelligence. In this paper we introduce an optimal data distribution algorithm for distributed cross join which combine each row from the first table with each row from the second table, which can reduce the network traffic and guarantee the computation balance of the distributed system.
  • Keywords
    Big Data; distributed algorithms; Big Data processing; business intelligence; computation balance; distributed cross-join efficiency; distributed system; network traffic reduction; optimal data distribution algorithm; similarity calculation; Algorithm design and analysis; Clustering algorithms; Computational modeling; Distributed databases; Niobium; Optimization; cross join; data distribution;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Cloud and Big Data (CBD), 2013 International Conference on
  • Conference_Location
    Nanjing
  • Print_ISBN
    978-1-4799-3260-3
  • Type

    conf

  • DOI
    10.1109/CBD.2013.5
  • Filename
    6824581