• DocumentCode
    2568545
  • Title

    Dimensionality effects on the Markov property in Shape Memory Alloy hysteretic environment

  • Author

    Kirkpatrick, Kenton ; Valasek, John

  • Author_Institution
    Aerosp. Eng. Dept., Texas A&M Univ., College Station, TX, USA
  • fYear
    2009
  • fDate
    11-14 Oct. 2009
  • Firstpage
    2671
  • Lastpage
    2676
  • Abstract
    Shape memory alloy actuators can be used for morphing, or shape change, by controlling their temperature, which is effectively done by applying a voltage difference across their length. Control of these actuators requires determination of the relationship between voltage and strain so that an input-output map can be developed. To determine this policy and map the hysteretic region, a reinforcement learning algorithm called Sarsa was used. Proper use of reinforcement learning requires that the learning environment have the Markov property. However, hysteresis spaces are commonly referenced as non-Markovian due to the fact that state history is needed to properly predict future states and rewards. This paper reveals that this formerly non-Markovian learning environment of shape memory alloy hysteresis can become Markovian by means of increasing the dimensionality of the measured states. The paper compares learning attempts in both versions of the environment and will show that reinforcement learning is successful in the modified learning environment by learning a near-optimal policy for controlling the length of a shape memory alloy wire. This is then validated by using the modified reinforcement learning agent to learn a near-optimal control policy in an experimental setting.
  • Keywords
    Markov processes; actuators; control engineering computing; hysteresis; learning (artificial intelligence); optimal control; shape memory effects; size control; temperature control; Markov property; Sarsa; actuators; dimensionality effects; length control; near-optimal control policy; reinforcement learning algorithm; shape memory alloy hysteresis; temperature control; Actuators; Capacitive sensors; History; Hysteresis; Learning; Shape control; Shape memory alloys; Strain control; Temperature control; Voltage control; Markov Property; Shape Memory Alloy; hysteresis; morphing; reinforcement learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man and Cybernetics, 2009. SMC 2009. IEEE International Conference on
  • Conference_Location
    San Antonio, TX
  • ISSN
    1062-922X
  • Print_ISBN
    978-1-4244-2793-2
  • Electronic_ISBN
    1062-922X
  • Type

    conf

  • DOI
    10.1109/ICSMC.2009.5346132
  • Filename
    5346132