DocumentCode :
48496
Title :
Using Equilibrium Policy Gradients for Spatiotemporal Planning in Forest Ecosystem Management
Author :
Crowley, Michael
Author_Institution :
Dept. of Electr. & Comput. Eng., Oregon State Univ., Corvallis, OR, USA
Volume :
63
Issue :
1
fYear :
2014
fDate :
Jan. 2014
Firstpage :
142
Lastpage :
154
Abstract :
Spatiotemporal planning involves making choices at multiple locations in space over some planning horizon to maximize utility and satisfy various constraints. In Forest Ecosystem Management, the problem is to choose actions for thousands of locations each year including harvesting, treating trees for fire or pests, or doing nothing. The utility models could place value on sale of lumber, ecosystem sustainability or employment levels and incorporate legal and logistical constraints on actions such as avoiding large contiguous areas of clearcutting. Simulators developed by forestry researchers provide detailed dynamics but are generally inaccesible black boxes. We model spatiotemporal planning as a factored Markov decision process and present a policy gradient planning algorithm to optimize a stochastic spatial policy using simulated dynamics. It is common in environmental and resource planning to have actions at different locations be spatially interelated; this makes representation and planning challenging. We define a global spatial policy in terms of interacting local policies defining distributions over actions at each location conditioned on actions at nearby locations. Markov chain Monte Carlo simulation is used to sample landscape policies and estimate their gradients. Evaluation is carried out on a forestry planning problem with 1,880 locations using a variety of value models and constraints.
Keywords :
Markov processes; Monte Carlo methods; ecology; forestry; gradient methods; optimisation; planning; sustainable development; utility theory; Markov chain Monte Carlo simulation; ecosystem sustainability; employment levels; environmental planning; equilibrium policy gradient planning algorithm; factored Markov decision process; forest ecosystem management; forestry planning problem; generally inaccesible black boxes; global spatial policy; interacting local policies; landscape policies; legal constraints; logistical constraints; lumber; planning horizon; resource planning; simulated dynamics; spatially interelated locations; spatiotemporal planning; stochastic spatial policy optimization; utility models; Markov decision processes; computational sustainability; ecosystem management; forestry planning; machine learning; optimization; policy gradient planning; reinforcement learning;
fLanguage :
English
Journal_Title :
Computers, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9340
Type :
jour
DOI :
10.1109/TC.2013.113
Filename :
6514032
Link To Document :
بازگشت