DocumentCode
2568545
Title
Dimensionality effects on the Markov property in Shape Memory Alloy hysteretic environment
Author
Kirkpatrick, Kenton ; Valasek, John
Author_Institution
Aerosp. Eng. Dept., Texas A&M Univ., College Station, TX, USA
fYear
2009
fDate
11-14 Oct. 2009
Firstpage
2671
Lastpage
2676
Abstract
Shape memory alloy actuators can be used for morphing, or shape change, by controlling their temperature, which is effectively done by applying a voltage difference across their length. Control of these actuators requires determination of the relationship between voltage and strain so that an input-output map can be developed. To determine this policy and map the hysteretic region, a reinforcement learning algorithm called Sarsa was used. Proper use of reinforcement learning requires that the learning environment have the Markov property. However, hysteresis spaces are commonly referenced as non-Markovian due to the fact that state history is needed to properly predict future states and rewards. This paper reveals that this formerly non-Markovian learning environment of shape memory alloy hysteresis can become Markovian by means of increasing the dimensionality of the measured states. The paper compares learning attempts in both versions of the environment and will show that reinforcement learning is successful in the modified learning environment by learning a near-optimal policy for controlling the length of a shape memory alloy wire. This is then validated by using the modified reinforcement learning agent to learn a near-optimal control policy in an experimental setting.
Keywords
Markov processes; actuators; control engineering computing; hysteresis; learning (artificial intelligence); optimal control; shape memory effects; size control; temperature control; Markov property; Sarsa; actuators; dimensionality effects; length control; near-optimal control policy; reinforcement learning algorithm; shape memory alloy hysteresis; temperature control; Actuators; Capacitive sensors; History; Hysteresis; Learning; Shape control; Shape memory alloys; Strain control; Temperature control; Voltage control; Markov Property; Shape Memory Alloy; hysteresis; morphing; reinforcement learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man and Cybernetics, 2009. SMC 2009. IEEE International Conference on
Conference_Location
San Antonio, TX
ISSN
1062-922X
Print_ISBN
978-1-4244-2793-2
Electronic_ISBN
1062-922X
Type
conf
DOI
10.1109/ICSMC.2009.5346132
Filename
5346132
Link To Document