مرکز منطقه ای اطلاع رساني علوم و فناوري - ROCK∗ — Efficient black-box optimization for policy learning

DocumentCode :

117524

Title :

ROCK∗ — Efficient black-box optimization for policy learning

Author :

Jemin Hwangbo ; Gehring, Christian ; Sommer, Hannes ; Siegwart, Roland ; Buchli, Jonas

fYear :

2014

fDate :

18-20 Nov. 2014

Firstpage :

535

Lastpage :

540

Abstract :

Robotic learning on real hardware requires an efficient algorithm which minimizes the number of trials needed to learn an optimal policy. Prolonged use of hardware causes wear and tear on the system and demands more attention from an operator. To this end, we present a novel black-box optimization algorithm, Reward Optimization with Compact Kernels and fast natural gradient regression (ROCK*). Our algorithm immediately updates knowledge after a single trial and is able to extrapolate in a controlled manner. These features make fast and safe learning on real hardware possible. We have evaluated our algorithm on two simulated reaching tasks of a 50 degree-of-freedom robot arm and on a hopping task of a real articulated legged system. ROCK* outperformed current state-of-the-art algorithms in all tasks by a factor of three or more.

Keywords :

extrapolation; humanoid robots; learning systems; legged locomotion; manipulators; optimisation; regression analysis; 50 degree-of-freedom robot arm; ROCK*; articulated legged system; black-box optimization algorithm; extrapolation; hopping task; natural gradient regression; optimal policy learning; reward optimization with compact kernels; robotic learning; Kernel; Legged locomotion; Optimization; Rocks; Trajectory; Vectors;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Humanoid Robots (Humanoids), 2014 14th IEEE-RAS International Conference on

Conference_Location :

Madrid

Type :

conf

DOI :

10.1109/HUMANOIDS.2014.7041414

Filename :

7041414

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=117524