مرکز منطقه ای اطلاع رساني علوم و فناوري - Using Equilibrium Policy Gradients for Spatiotemporal Planning in Forest Ecosystem Management

DocumentCode :

48496

Title :

Using Equilibrium Policy Gradients for Spatiotemporal Planning in Forest Ecosystem Management

Author :

Crowley, Michael

Author_Institution :

Dept. of Electr. & Comput. Eng., Oregon State Univ., Corvallis, OR, USA

Volume :

Issue :

fYear :

2014

fDate :

Jan. 2014

Firstpage :

142

Lastpage :

154

Abstract :

Spatiotemporal planning involves making choices at multiple locations in space over some planning horizon to maximize utility and satisfy various constraints. In Forest Ecosystem Management, the problem is to choose actions for thousands of locations each year including harvesting, treating trees for fire or pests, or doing nothing. The utility models could place value on sale of lumber, ecosystem sustainability or employment levels and incorporate legal and logistical constraints on actions such as avoiding large contiguous areas of clearcutting. Simulators developed by forestry researchers provide detailed dynamics but are generally inaccesible black boxes. We model spatiotemporal planning as a factored Markov decision process and present a policy gradient planning algorithm to optimize a stochastic spatial policy using simulated dynamics. It is common in environmental and resource planning to have actions at different locations be spatially interelated; this makes representation and planning challenging. We define a global spatial policy in terms of interacting local policies defining distributions over actions at each location conditioned on actions at nearby locations. Markov chain Monte Carlo simulation is used to sample landscape policies and estimate their gradients. Evaluation is carried out on a forestry planning problem with 1,880 locations using a variety of value models and constraints.

Keywords :

Markov processes; Monte Carlo methods; ecology; forestry; gradient methods; optimisation; planning; sustainable development; utility theory; Markov chain Monte Carlo simulation; ecosystem sustainability; employment levels; environmental planning; equilibrium policy gradient planning algorithm; factored Markov decision process; forest ecosystem management; forestry planning problem; generally inaccesible black boxes; global spatial policy; interacting local policies; landscape policies; legal constraints; logistical constraints; lumber; planning horizon; resource planning; simulated dynamics; spatially interelated locations; spatiotemporal planning; stochastic spatial policy optimization; utility models; Markov decision processes; computational sustainability; ecosystem management; forestry planning; machine learning; optimization; policy gradient planning; reinforcement learning;

fLanguage :

English

Journal_Title :

Computers, IEEE Transactions on

Publisher :

ieee

ISSN :

0018-9340

Type :

jour

DOI :

10.1109/TC.2013.113

Filename :

6514032

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=48496