Title :
Improving learning and adaptation in security games by exploiting information asymmetry
Author :
Xiaofan He ; Huaiyu Dai ; Peng Ning
Author_Institution :
Dept. of ECE, North Carolina State Univ., Raleigh, NC, USA
fDate :
April 26 2015-May 1 2015
Abstract :
With the advancement of modern technologies, the security battle between a legitimate system (LS) and an adversary is becoming increasingly sophisticated, involving complex interactions in unknown dynamic environments. Stochastic game (SG), together with multi-agent reinforcement learning (MARL), offers a systematic framework for the study of information warfare in current and emerging cyber-physical systems. In practical security games, each player usually has only incomplete information about the opponent, which induces information asymmetry. This work exploits information asymmetry from a new angle, considering how to exploit local information unknown to the opponent to the player´s advantage. Two new MARL algorithms, termed minimax-PDS and WoLF-PDS, are proposed, which enable the LS to learn and adapt faster in dynamic environments by exploiting its private local information. The proposed algorithms are provably convergent and rational, respectively. Also, numerical results are presented to show their effectiveness through two concrete anti-jamming examples.
Keywords :
learning (artificial intelligence); multi-agent systems; security of data; stochastic games; LS; MARL; SG; WoLF-PDS; adaptation; concrete anti-jamming; cyber-physical systems; information asymmetry; information warfare; legitimate system; minimax-PDS; multiagent reinforcement learning; security games; stochastic game; unknown dynamic environments; Computers; Conferences; Games; Heuristic algorithms; Jamming; Security; Sensors;
Conference_Titel :
Computer Communications (INFOCOM), 2015 IEEE Conference on
Conference_Location :
Kowloon
DOI :
10.1109/INFOCOM.2015.7218560