{Feudal Multi-Agent Hierarchies for Cooperative Reinforcement Learning}

Institute Homepage

Institute Homepage DE Sign In

Proceedings 2019

Feudal Multi-Agent Hierarchies for Cooperative Reinforcement Learning

{We investigate how reinforcement learning agents can learn to cooperate. Drawing inspiration from human societies, in which successful coordination of many individuals is often facilitated by hierarchical organisation, we introduce Feudal Multi-agent Hierarchies (FMH). In this framework, a \textquoteleftmanager\textquoteright agent, which is tasked with maximising the environmentally-determined reward function, learns to communicate subgoals to multiple, simultaneously-operating, \textquoteleftworker\textquoteright agents. Workers, which are rewarded for achieving managerial subgoals, take concurrent actions in the world. We outline the structure of FMH and demonstrate its potential for decentralised learning and control. We find that, given an adequate set of subgoals from which to choose, FMH performs, and particularly scales, substantially better than cooperative approaches that use a shared reward function.}

Author(s):	Ahilan, S and Dayan, P
Book Title:	4th Multidisciplinary Conference on Reinforcement Learning and Decision Making (RLDM 2019)
Pages:	57
Year:	2019

Bibtex Type:	Proceedings (proceedings)

Electronic Archiving:	grant_archive

BibTex

@proceedings{item_3152813,
  title = {{Feudal Multi-Agent Hierarchies for Cooperative Reinforcement Learning}},
  booktitle = {{4th Multidisciplinary Conference on Reinforcement Learning and Decision Making (RLDM 2019)}},
  abstract = {{We investigate how reinforcement learning agents can learn to cooperate. Drawing inspiration from human societies, in which successful coordination of many individuals is often facilitated by hierarchical organisation, we introduce Feudal Multi-agent Hierarchies (FMH). In this framework, a \textquoteleftmanager\textquoteright agent, which is tasked with maximising the environmentally-determined reward function, learns to communicate subgoals to multiple, simultaneously-operating, \textquoteleftworker\textquoteright agents. Workers, which are rewarded for achieving managerial subgoals, take concurrent actions in the world. We outline the structure of FMH and demonstrate its potential for decentralised learning and control. We find that, given an adequate set of subgoals from which to choose, FMH performs, and particularly scales, substantially better than cooperative approaches that use a shared reward function.}},
  pages = {57},
  year = {2019},
  slug = {item_3152813},
  author = {Ahilan, S and Dayan, P}
}

Research

Departments

Research Groups

People

Contact

Our Institute

Our History

Career

Doctoral Programs

Training

Service Units

Central Scientific Facilities

Workshops

Campus Services

Impact

Cooperation

Partners and Initiatives