Title :
Tunable and generic problem instance generation for multi-objective reinforcement learning
Author :
Garrett, Deon ; Bieger, Jordi ; Thorisson, Kristinn R.
Author_Institution :
Icelandic Inst. for Intell. Machines, Reykjavik Univ., Reykjavik, Iceland
Abstract :
A significant problem facing researchers in reinforcement learning, and particularly in multi-objective learning, is the dearth of good benchmarks. In this paper, we present a method and software tool enabling the creation of random problem instances, including multi-objective learning problems, with specific structural properties. This tool, called Merlin (for Multi-objective Environments for Reinforcement LearnINg), provides the ability to control these features in predictable ways, thus allowing researchers to begin to build a more detailed understanding about what features of a problem interact with a given learning algorithm to improve or degrade the algorithm´s performance. We present this method and tool, and briefly discuss the controls provided by the generator, its supported options, and their implications on the generated benchmark instances.
Keywords :
learning (artificial intelligence); software tools; Merlin; learning algorithm; multiobjective environments for reinforcement learning; multiobjective learning problem; multiobjective reinforcement learning; problem facing researcher; random problem instance; software tool; structural property; Benchmark testing; Correlation; Covariance matrices; Generators; Heuristic algorithms; Learning (artificial intelligence); Optimization;
Conference_Titel :
Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2014 IEEE Symposium on
Conference_Location :
Orlando, FL
DOI :
10.1109/ADPRL.2014.7010646