• DocumentCode
    3064031
  • Title

    A Cloud Framework for Parameter Sweeping Data Mining Applications

  • Author

    Marozzo, Fabrizio ; Talia, Domenico ; Trunfio, Paolo

  • Author_Institution
    DEIS, Univ. of Calabria, Rende, Italy
  • fYear
    2011
  • fDate
    Nov. 29 2011-Dec. 1 2011
  • Firstpage
    367
  • Lastpage
    374
  • Abstract
    Data mining techniques are used in many application areas to extract useful knowledge from large datasets. Very often, parameter sweeping is used in data mining applications to explore the effects produced on the data analysis result by different values of the algorithm parameters. Parameter sweeping applications can be highly computing demanding, since the number of single tasks to be executed increases with the number of swept parameters and the range of their values. Cloud technologies can be effectively exploited to provide end-users with the computing and storage resources, and the execution mechanisms needed to efficiently run this class of applications. In this paper, we present a Data Mining Cloud App framework that supports the execution of parameter sweeping data mining applications on a Cloud. The framework has been implemented using the Windows Azure platform, and evaluated through a set of parameter sweeping clustering and classification applications. The experimental results demonstrate the effectiveness of the proposed framework, as well as the scalability that can be achieved through the parallel execution of parameter sweeping applications on a pool of virtual servers.
  • Keywords
    cloud computing; data analysis; data mining; knowledge acquisition; pattern classification; pattern clustering; algorithm parameter; cloud computing; cloud technology; data mining cloud application framework; data sets; knowledge extraction; parallel execution; parameter sweeping classification; parameter sweeping clustering; parameter sweeping data mining; storage resource; swept parameter sweeping application; virtual server; windows Azure platform; Cloud computing; Clustering algorithms; Data mining; Monitoring; Servers; User interfaces; Cloud computing; Data mining; Parameter sweeping;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cloud Computing Technology and Science (CloudCom), 2011 IEEE Third International Conference on
  • Conference_Location
    Athens
  • Print_ISBN
    978-1-4673-0090-2
  • Type

    conf

  • DOI
    10.1109/CloudCom.2011.56
  • Filename
    6133165