شماره ركورد كنفرانس :
3860
عنوان مقاله :
Flappy Bird with Deep Reinforcement Learning
پديدآورندگان :
Parhizkar Amirreza Amir.parhiz@aut.ac.ir Amirkabir University of Technology, Tehran , Amani Mojtaba Amanimojtaba329@gmail.com University of Bojnord
كليدواژه :
machine learning , neural network , reinforcement , convolution
عنوان كنفرانس :
دومين كنفرانس ملي محاسبات نرم
چكيده فارسي :
In this paper, a convolutional neural network model recently developed by Minh et al 2015 is applied to evaluate the Qfunction from raw pixel values from the screen. We took advantage of this method for Flappy Bird, a mobile game which is well known for being hard for humans to play. This method is capable of approximating Q-function to allow generalization to unseen states, not only it leads to faster convergence, but also it makes proposed method to be generalize enough to apply for different problems in different domain without changing much.