• DocumentCode
    2694311
  • Title

    Controller design for quadrotor UAVs using reinforcement learning

  • Author

    Bou-Ammar, Haitham ; Voos, Holger ; Ertel, Wolfgang

  • Author_Institution
    Mobile Robot. Lab., Univ. of Appl. Sci. Ravensburg-Weingarten, Weingarten, Germany
  • fYear
    2010
  • fDate
    8-10 Sept. 2010
  • Firstpage
    2130
  • Lastpage
    2135
  • Abstract
    Quadrotor UAVs are one of the most preferred type of small unmanned aerial vehicles because of the very simple mechanical construction and propulsion principle. However, the nonlinear dynamic behavior requires a rather advanced stabilizing control of these vehicles. One possible approach that relaxes the difficult task of nonlinear control design is the application of a learning algorithm that allows the training of suitable control actions. Here we apply reinforcement learning as one form of unsupervised learning. In this paper, we first propose a nonlinear autopilot for quadrotor UAVs based on feedback linearization. This controller is then compared to an autopilot which has been learned by reinforcement learning using fitted value iteration with regard to design effort and performance. First simulation and experimental results underline the outcome of this comparison.
  • Keywords
    aerospace robotics; aircraft control; feedback; learning systems; mobile robots; nonlinear control systems; remotely operated vehicles; stability; unsupervised learning; feedback linearization; fitted value iteration; nonlinear autopilot; nonlinear control design; nonlinear dynamic behavior; quadrotor UAV; reinforcement learning; small unmanned aerial vehicles; stabilizing control; unsupervised learning; Control systems; Input variables; Learning; Mathematical model; Rotors; Unmanned aerial vehicles;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Control Applications (CCA), 2010 IEEE International Conference on
  • Conference_Location
    Yokohama
  • Print_ISBN
    978-1-4244-5362-7
  • Electronic_ISBN
    978-1-4244-5363-4
  • Type

    conf

  • DOI
    10.1109/CCA.2010.5611206
  • Filename
    5611206