• DocumentCode
    3103342
  • Title

    DryadLINQ for Scientific Analyses

  • Author

    Ekanayake, Jaliya ; Gunarathne, Thilina ; Fox, Geoffrey ; Balkir, Atilla Soner ; Poulain, Christophe ; Araujo, Nelson ; Barga, Roger

  • Author_Institution
    Sch. of Inf. & Comput., Indiana Univ., Bloomington, IN, USA
  • fYear
    2009
  • fDate
    9-11 Dec. 2009
  • Firstpage
    329
  • Lastpage
    336
  • Abstract
    Applying high level parallel runtimes to data/compute intensive applications is becoming increasingly common. The simplicity of the MapReduce programming model and the availability of open source MapReduce runtimes such as Hadoop, are attracting more users to the MapReduce programming model. Microsoft has released DryadLINQ for academic use, allowing users to experience a new programming model and a runtime that is capable of performing large scale data/compute intensive analyses. In this paper, we present our experience in applying DryadLINQ for a series of scientific data analysis applications, identify their mapping to the DryadLINQ programming model, and compare their performances with Hadoop implementations of the same applications.
  • Keywords
    data analysis; natural sciences computing; parallel programming; DryadLINQ; Hadoop; MapReduce programming model; Microsoft; academic use; high level parallel runtimes; open source MapReduce runtimes; scientific analyses; scientific data analysis applications; Astronomy; Availability; Conference management; Fault tolerant systems; Hardware; Monitoring; Physics computing; Pipelines; Resilience; USA Councils; Cloud Computing; DryadLINQ; Hadoop; MapReduce;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    e-Science, 2009. e-Science '09. Fifth IEEE International Conference on
  • Conference_Location
    Oxford
  • Print_ISBN
    978-0-7695-3877-8
  • Type

    conf

  • DOI
    10.1109/e-Science.2009.53
  • Filename
    5380850