• DocumentCode
    3739922
  • Title

    Efficient Top-k Skyline Computation in MapReduce

  • Author

    Baoyan Song;Aili Liu;Linlin Ding

  • Author_Institution
    Sch. of Inf., Liaoning Univ., Shenyang, China
  • fYear
    2015
  • Firstpage
    67
  • Lastpage
    70
  • Abstract
    Skyline is widely used in multi-objective decisionmaking, data visualization and other fields. With the rapid increasing of data volume, skyline of big data has also attracted more and more attention. However, skyline of big data has its own shortcomings. When the dimension increases, skyline results will be numerous, and we would like to select k points from the result sets. In this paper, we propose the top-k skyline of big data. It is a Distributed Top-k Skyline Method in MapReduce, called MR-DTKS. Firstly, we convert the multidimensional data to a single value to determine the dominance relationship of two data points. Secondly, we calculate the score by using the converted values to filter out most of unwanted data objects. Finally, we choose k data objects having the strongest dominating capacity. A large number of experiments show that our method is effective, and has good flexibility and scalability on real data sets as well as synthetic data sets.
  • Keywords
    "Big data","Computational efficiency","Data visualization","Algorithm design and analysis","Distributed databases","Indexes"
  • Publisher
    ieee
  • Conference_Titel
    Web Information System and Application Conference (WISA), 2015 12th
  • Print_ISBN
    978-1-4673-9371-3
  • Type

    conf

  • DOI
    10.1109/WISA.2015.57
  • Filename
    7396609