• DocumentCode
    606034
  • Title

    Impact of I/O and execution scheduling strategies on large scale parallel data mining

  • Author

    Benjamas, Nunnapus ; Uthayopas, Putchong

  • Author_Institution
    Dept. of Comput. Sci., Khonkaen Univ., Khonkaen, Thailand
  • fYear
    2012
  • fDate
    23-25 Oct. 2012
  • Firstpage
    654
  • Lastpage
    660
  • Abstract
    In the era of “Big Data”, there is an emerging need to process a massive data set using large cluster system. Anyway, without the right strategies to handle the data, it is challenging to gain a good performance from the system. In this paper, many I/O and execution scheduling strategies for parallel data mining application has been investigated. The goal is to discover strategies that balance the data processing load and better utilize a multi-core cluster system for data mining application. Issues that impact the performance have been explored. The simulation results show that a substantial performance improvement can be obtained especially with a multi-core cluster system when a proper I/O and task execution sequence scheduling has been employed.
  • Keywords
    data handling; data mining; pattern clustering; I/O strategies; big data; data handling; data processing load; execution scheduling strategies; large cluster system; large scale parallel data mining; multicore cluster system; I/O scheduling; data mining; execution scheduling; multi-core; parallel FI-growth;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Science and Service Science and Data Mining (ISSDM), 2012 6th International Conference on New Trends in
  • Conference_Location
    Taipei
  • Print_ISBN
    978-1-4673-0876-2
  • Type

    conf

  • Filename
    6528714