DocumentCode
760411
Title
Self-Learning Disk Scheduling
Author
Zhang, Yu ; Bhargava, Bharat
Author_Institution
Dept. of Comput. Sci., Purdue Univ., West Lafayette, IN
Volume
21
Issue
1
fYear
2009
Firstpage
50
Lastpage
65
Abstract
Performance of disk I/O schedulers is affected by many factors, such as workloads, file systems, and disk systems. Disk scheduling performance can be improved by tuning scheduler parameters, such as the length of read timers. Scheduler performance tuning is mostly done manually. To automate this process, we propose four self-learning disk scheduling schemes: change-sensing Round-Robin, feedback learning, per-request learning, and two-layer learning. experiments show that the novel two-layer learning scheme performs best. It integrates the workload-level and request-level learning algorithms. It employs feedback learning techniques to analyze workloads, change scheduling policy, and tune scheduling parameters automatically. We discuss schemes to choose features for workload learning, divide and recognize workloads, generate training data, and integrate machine learning algorithms into the two-layer learning scheme. We conducted experiments to compare the accuracy, performance, and overhead of five machine learning algorithms: decision tree, logistic regression, naive Bayes, neural network, and support vector machine algorithms. Experiments with real-world and synthetic workloads show that self-learning disk scheduling can adapt to a wide variety of workloads, file systems, disk systems, and user preferences. It outperforms existing disk schedulers by as much as 15.8% while consuming less than 3%-5% of CPU time.
Keywords
belief networks; disc storage; learning (artificial intelligence); operating systems (computers); scheduling; support vector machines; change-sensing Round-Robin; decision tree; disk I/O schedulers; disk systems; feedback learning; file systems; logistic regression; machine learning algorithms; naive Bayes; neural network; operating system; per-request learning; self-learning disk scheduling; support vector machine algorithms; tuning scheduler parameters; two-layer learning; Application-transparent adaptation; Input/output; Machine learning; Sequencing and scheduling;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2008.116
Filename
4547426
Link To Document