مرکز منطقه ای اطلاع رساني علوم و فناوري - Mean Field LQG Control in Leader-Follower Stochastic Multi-Agent Systems: Likelihood Ratio Based Adaptation

DocumentCode :

1501781

Title :

Mean Field LQG Control in Leader-Follower Stochastic Multi-Agent Systems: Likelihood Ratio Based Adaptation

Author :

Nourian, Mojtaba ; Caines, Peter E. ; Malhame, Roland P. ; Minyi Huang

Author_Institution :

Dept. of Electr. & Comput. Eng., McGill Univ., Montreal, QC, Canada

Volume :

Issue :

fYear :

2012

Firstpage :

2801

Lastpage :

2816

Abstract :

We study large population leader-follower stochastic multi-agent systems where the agents have linear stochastic dynamics and are coupled via their quadratic cost functions. The cost of each leader is based on a trade-off between moving toward a certain reference trajectory which is unknown to the followers and staying near their own centroid. On the other hand, followers react by tracking a convex combination of their own centroid and the centroid of the leaders. We approach this large population dynamic game problem by use of so-called Mean Field (MF) linear-quadratic-Gaussian (LQG) stochastic control theory. In this model, followers are adaptive in the sense that they use a likelihood ratio estimator (on a sample population of the leaders´ trajectories) to identify the member of a given finite class of models which is generating the reference trajectory of the leaders. Under appropriate conditions, it is shown that the true reference trajectory model is identified by each follower in finite time with probability one as the leaders´ population goes to infinity. Furthermore, we show that the resulting sets of mean field control laws for both leaders and adaptive followers possess an almost sure εN-Nash equilibrium property for a system with population N where εN goes to zero as N goes to infinity. Numerical experiments are presented illustrating the results.

Keywords :

Gaussian processes; game theory; multi-agent systems; stochastic systems; Nash equilibrium property; adaptive followers; centroid; convex combination; leader follower stochastic multiagent system; likelihood ratio based adaptation; likelihood ratio estimator; linear stochastic dynamics; mean field LQG control; mean field control laws; mean field linear quadratic Gaussian stochastic control theory; population dynamic game problem; probability; quadratic cost function; reference trajectory model; Adaptation models; Games; Lead; Mathematical model; Noise measurement; Stochastic processes; Trajectory; Adaptive control; Nash equilibria; leader-follower collective behavior; likelihood ratio based adaptation; mean field (MF) stochastic control; stochastic optimal control;

fLanguage :

English

Journal_Title :

Automatic Control, IEEE Transactions on

Publisher :

ieee

ISSN :

0018-9286

Type :

jour

DOI :

10.1109/TAC.2012.2195797

Filename :

6189042

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1501781