• DocumentCode
    3078554
  • Title

    Cost-Efficient High-Performance Internet-Scale Data Analytics over Multi-cloud Environments

  • Author

    Imai, Shigeru ; Patterson, Stacy ; Varela, Carlos A.

  • Author_Institution
    Dept. of Comput. Sci., Rensselaer Polytech. Inst., Troy, NY, USA
  • fYear
    2015
  • fDate
    4-7 May 2015
  • Firstpage
    793
  • Lastpage
    796
  • Abstract
    To analyze data distributed across the world, one can use distributed computing power to take advantage of data locality and achieve higher throughput. The multi-cloud model, a composition of multiple clouds, can provide cost-effective computing resources to process such distributed data. As multicolour becomes more and more accessible from cloud users, the use of MapReduce/Hadoop over multi-cloud is emerging, however, existing work has two issues in principle. First, it mainly focuses on maximizing throughput by improving data locality, but the perspective of cost optimization is missing. Second, conventional centralized optimization methods would not be able to scale well in multi-cloud environments due to its highly dynamic nature. We plan to solve the first issue by formalizing an optimization framework for MapReduce over multi-cloud including virtual machine and data transfer costs, and then the second issue by creating decentralized resource management middleware that considers multi-criteria (cost and performance) optimization. This paper reports progress we have made so far on these two directions.
  • Keywords
    cloud computing; data analysis; middleware; parallel processing; resource allocation; virtual machines; MapReduce; cost-efficient high-performance Internet-scale data analytics; data locality; data transfer costs; decentralized resource management middleware; multicloud environments; multicriteria optimization; virtual machine; Cloud computing; Data analysis; Data transfer; Distributed databases; Optimization; Resource management; Throughput; data analytics; multi-cloud; optimization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster, Cloud and Grid Computing (CCGrid), 2015 15th IEEE/ACM International Symposium on
  • Conference_Location
    Shenzhen
  • Type

    conf

  • DOI
    10.1109/CCGrid.2015.158
  • Filename
    7152559