DocumentCode
22947
Title
Learning to Reproduce Fluctuating Time Series by Inferring Their Time-Dependent Stochastic Properties: Application in Robot Learning Via Tutoring
Author
Murata, Shotaro ; Namikawa, Jun ; Arie, Hiroaki ; Sugano, S. ; Tani, Jacopo
Author_Institution
Dept. of Modern Mech. Eng., Waseda Univ., Tokyo, Japan
Volume
5
Issue
4
fYear
2013
fDate
Dec. 2013
Firstpage
298
Lastpage
310
Abstract
This study proposes a novel type of dynamic neural network model that can learn to extract stochastic or fluctuating structures hidden in time series data. The network learns to predict not only the mean of the next input state, but also its time-dependent variance. The training method is based on maximum likelihood estimation by using the gradient descent method and the likelihood function is expressed as a function of the estimated variance. Regarding the model evaluation, we present numerical experiments in which training data were generated in different ways utilizing Gaussian noise. Our analysis showed that the network can predict the time-dependent variance and the mean and it can also reproduce the target stochastic sequence data by utilizing the estimated variance. Furthermore, it was shown that a humanoid robot using the proposed network can learn to reproduce latent stochastic structures hidden in fluctuating tutoring trajectories. This learning scheme is essential for the acquisition of sensory-guided skilled behavior.
Keywords
Gaussian noise; gradient methods; humanoid robots; learning systems; maximum likelihood estimation; neurocontrollers; time series; Gaussian noise; dynamic neural network model; estimated variance function; fluctuating structures; fluctuating time series; fluctuating tutoring trajectories; gradient descent method; humanoid robot; learning scheme; likelihood function; maximum likelihood estimation; model evaluation; robot learning; sensory-guided skilled behavior; stochastic sequence data; stochastic structures; time series data; time-dependent stochastic properties; time-dependent variance; training method; Humanoids; Maximum likelihood estimation; Neural networks; Recurrent neural networks; Dynamical systems approach; humanoid robot; maximum likelihood estimation; recurrent neural network;
fLanguage
English
Journal_Title
Autonomous Mental Development, IEEE Transactions on
Publisher
ieee
ISSN
1943-0604
Type
jour
DOI
10.1109/TAMD.2013.2258019
Filename
6502665
Link To Document