Commitment Without Regrets: Online Learning in Stackelberg Security Games

Maria-Florina Balcan; Avrim Blum; Nika Haghtalab; Ariel D. Procaccia

Commitment Without Regrets: Online Learning in Stackelberg Security Games

Maria-Florina Balcan ,
Avrim Blum ,
Nika Haghtalab ,
Ariel D. Procaccia

Proceedings of the 15th ACM Conference on Economics and Computation (EC), 2015. | July 2015

Download BibTex

In a Stackelberg Security Game, a defender commits to a randomized deployment of security resources, and an attacker best responds by attacking a target that maximizes his utility. While algorithms for computing an optimal strategy for the defender to commit to have had a striking real-world impact, deployed applications require signiﬁcant information about potential attackers, leading to inefﬁciencies. We address this problem via an online learning approach. We are interested in algorithms that prescribe a randomized strategy for the defender at each step against an adversarially chosen sequence of attackers, and obtain feedback on their choices (observing either the current attacker type or merely which target was attacked). We design no-regret algorithms whose regret (when compared to the best ﬁxed strategy in hindsight) is polynomial in the parameters of the game, and sublinear in the number of times steps.