H. Sheikh, M. Razghandi, and L. Bölöni

Learning Distributed Cooperative Policies For Security Games via Deep Reinforcement Learning


Cite as:

H. Sheikh, M. Razghandi, and L. Bölöni. Learning Distributed Cooperative Policies For Security Games via Deep Reinforcement Learning. In Proc. of Conference on Computers, Software and Applications (COMPSAC-2019), pp. 489–494, July 2019.

Download:

(unavailable)

Abstract:

A rich amount of literature is available for solving the problem of finding equilibrium strategies in two-player security games that harness the power of integer linear programming (ILP). However, in practice, most security games are accurately modeled with multiple agents where ILP methods either fail to find the optimal solution or the state space is large enough making ILP methods an impractical solution. In this paper, we consider a multi-agent security game setting and propose MultiOptGrad: a novel deep reinforcement learning-based solution to learn distributed optimal policies for defenders. Additionally, using MultiOptGrad we built an agent agnostic reinforcement learning framework for robotic bodyguards that recommend deployment strategies for them in a coordinate system. To demonstrate the effectiveness of our proposed solution, we consider an urban security game where a team of robotic bodyguards are protecting a VIP from physical assault in the presence of neutral and/or adversarial bystanders. Our empirical analysis has shown that policies learnt via MultiOptGrad outperformed quadrant load-balancing (QLB): a hand-engineered technique for solving the VIP protection problem.

BibTeX:

@inproceedings{Sheikh-2019-COMPSAC,
title = "Learning Distributed Cooperative Policies For Security Games via Deep Reinforcement Learning",
author = "H. Sheikh and M. Razghandi and L. B{\"o}l{\"o}ni",
booktitle = "Proc. of Conference on Computers, Software and Applications (COMPSAC-2019)",
year = "2019",
month = "July",
doi={10.1109/COMPSAC.2019.00075},
pages = "489-494",
abstract = {
  A rich amount of literature is available for solving the problem of finding equilibrium strategies in two-player security games that harness the power of integer linear programming (ILP). However, in practice, most security games are accurately modeled with multiple agents where ILP methods either fail to find the optimal solution or the state space is large enough making ILP methods an impractical solution. In this paper, we consider a multi-agent security game setting and propose MultiOptGrad: a novel deep reinforcement learning-based solution to learn distributed optimal policies for defenders. Additionally, using MultiOptGrad we built an agent agnostic reinforcement learning framework for robotic bodyguards that recommend deployment strategies for them in a coordinate system. To demonstrate the effectiveness of our proposed solution, we consider an urban security game where a team of robotic bodyguards are protecting a VIP from physical assault in the presence of neutral and/or adversarial bystanders. Our empirical analysis has shown that policies learnt via MultiOptGrad outperformed quadrant load-balancing (QLB): a hand-engineered technique for solving the VIP protection problem.
 },
}

Generated by bib2html.pl (written by Patrick Riley, Lotzi Boloni ) on Fri Jan 29, 2021 20:15:22