About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Publication
INFORMS 2022
Poster
A leader-follower game theoretic approach for training reinforcement learning agents
Abstract
This work presents a game-theoretic formulation of multi-agent curriculum learning to improve agent learning and provide game equilibrium insights. The learning is defined by a leader - follower cooperative game. Under this setup the leader can choose among several MDPs which one is the best one given follower’s actions. Each follower then chooses how it will be solving its task using an algorithm that combines opponent modelling techniques (estimates of leader’s and other followers’ actions) and reinforcement learning. We observed that under this framework in the agents needs only a small number of epochs to converge to a desired solution, compared to the reinforcement learning agent baseline.