• DocumentCode
    611000
  • Title

    Exploiting per user information for supercomputing workload prediction requires care

  • Author

    Dinh, Thanh Vinh ; Andrew, Lachlan L. H. ; Branch, P.

  • Author_Institution
    Swinburne Univ. of Technol., Hawthorn, VIC, Australia
  • fYear
    2013
  • fDate
    13-16 May 2013
  • Firstpage
    2
  • Lastpage
    9
  • Abstract
    Efficient management of supercomputing facilities requires estimates of future workload based on past user behaviour. For supercomputers with large numbers of users, aggregate user behaviour is commonly assumed to be best in prediction of future workloads, however for systems with smaller numbers of users the question arises as to whether it is still suitable or if benefits can be derived from monitoring individual user behaviour to predict future workload. We compare using individual user behaviour, aggregate user behaviour and a hybrid approach where we track heavy users individually and cluster aggregate light users into a small number of clusters. We find that the hybrid approach produces the best results in both mean absolute error and mean squared error. However, treating all users separately provides slightly worse predictions. We also introduce a new approach to prediction based on the hazard function which is a significant improvement on previously used schemes based on autoregressive models. The schemes are investigated numerically using a two-year workload trace from a supercomputer with a population of 136 users.
  • Keywords
    autoregressive processes; behavioural sciences; mean square error methods; parallel machines; social aspects of automation; aggregate user behaviour; autoregressive model; future workload prediction; hazard function; hybrid approach; mean absolute error; mean squared error; supercomputer; supercomputing facility management; supercomputing workload prediction; two-year workload trace; user behaviour monitoring; Adaptation models; Aggregates; Autoregressive processes; Computational modeling; Hazards; Predictive models; Supercomputers; hazard rate function; supercomputing; user behaviours; workload prediction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster, Cloud and Grid Computing (CCGrid), 2013 13th IEEE/ACM International Symposium on
  • Conference_Location
    Delft
  • Print_ISBN
    978-1-4673-6465-2
  • Type

    conf

  • DOI
    10.1109/CCGrid.2013.68
  • Filename
    6546052