• DocumentCode
    760411
  • Title

    Self-Learning Disk Scheduling

  • Author

    Zhang, Yu ; Bhargava, Bharat

  • Author_Institution
    Dept. of Comput. Sci., Purdue Univ., West Lafayette, IN
  • Volume
    21
  • Issue
    1
  • fYear
    2009
  • Firstpage
    50
  • Lastpage
    65
  • Abstract
    Performance of disk I/O schedulers is affected by many factors, such as workloads, file systems, and disk systems. Disk scheduling performance can be improved by tuning scheduler parameters, such as the length of read timers. Scheduler performance tuning is mostly done manually. To automate this process, we propose four self-learning disk scheduling schemes: change-sensing Round-Robin, feedback learning, per-request learning, and two-layer learning. experiments show that the novel two-layer learning scheme performs best. It integrates the workload-level and request-level learning algorithms. It employs feedback learning techniques to analyze workloads, change scheduling policy, and tune scheduling parameters automatically. We discuss schemes to choose features for workload learning, divide and recognize workloads, generate training data, and integrate machine learning algorithms into the two-layer learning scheme. We conducted experiments to compare the accuracy, performance, and overhead of five machine learning algorithms: decision tree, logistic regression, naive Bayes, neural network, and support vector machine algorithms. Experiments with real-world and synthetic workloads show that self-learning disk scheduling can adapt to a wide variety of workloads, file systems, disk systems, and user preferences. It outperforms existing disk schedulers by as much as 15.8% while consuming less than 3%-5% of CPU time.
  • Keywords
    belief networks; disc storage; learning (artificial intelligence); operating systems (computers); scheduling; support vector machines; change-sensing Round-Robin; decision tree; disk I/O schedulers; disk systems; feedback learning; file systems; logistic regression; machine learning algorithms; naive Bayes; neural network; operating system; per-request learning; self-learning disk scheduling; support vector machine algorithms; tuning scheduler parameters; two-layer learning; Application-transparent adaptation; Input/output; Machine learning; Sequencing and scheduling;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2008.116
  • Filename
    4547426