• DocumentCode
    3256347
  • Title

    DRS: Dynamic Resource Scheduling for Real-Time Analytics over Fast Streams

  • Author

    Fu, Tom Z. J. ; Jianbing Ding ; Ma, Richard T. B. ; Winslett, Marianne ; Yin Yang ; Zhenjie Zhang

  • Author_Institution
    Adv. Digital Sci. Center, Illinois at Singapore Pte. Ltd., Singapore, Singapore
  • fYear
    2015
  • fDate
    June 29 2015-July 2 2015
  • Firstpage
    411
  • Lastpage
    420
  • Abstract
    In a data stream management system (DSMS), users register continuous queries, and receive result updates as data arrive and expire. We focus on applications with real-time constraints, in which the user must receive each result update within a given period after the update occurs. To handle fast data, the DSMS is commonly placed on top of a cloud infrastructure. Because stream properties such as arrival rates can fluctuate unpredictably, cloud resources must be dynamically provisioned and scheduled accordingly to ensure real-time response. It is essential, for the existing systems or future developments, to possess the ability of scheduling resources dynamically according to the current workload, in order to avoid wasting resources, or failing in delivering correct results on time. Motivated by this, we propose DRS, a novel dynamic resource scheduler for cloud-based DSMSs. DRS overcomes three fundamental challenges: (a) how to model the relationship between the provisioned resources and query response time (b) where to best place resources, and (c) how to measure system load with minimal overhead. In particular, DRS includes an accurate performance model based on the theory of Jackson open queueing networks and is capable of handling arbitrary operator topologies, possibly with loops, splits and joins. Extensive experiments with real data confirm that DRS achieves real-time response with close to optimal resource consumption.
  • Keywords
    cloud computing; data handling; dynamic scheduling; resource allocation; DRS; DSMS; arbitrary operator topologies; cloud infrastructure; cloud resources; data stream management system; dynamic resource scheduling; fast streams; open queueing networks; real-time analytics; scheduling resources; Computational modeling; Delays; Dynamic scheduling; Feature extraction; Processor scheduling; Program processors; Real-time systems; data stream analytics; resource scheduling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Distributed Computing Systems (ICDCS), 2015 IEEE 35th International Conference on
  • Conference_Location
    Columbus, OH
  • ISSN
    1063-6927
  • Type

    conf

  • DOI
    10.1109/ICDCS.2015.49
  • Filename
    7164927