arxiv arXiv cs.LG · 6d ago · research

LLM-based Hierarchical Control in Multi-Agent Games

from English

A hierarchical system using a pretrained LLM to select RL skill policies outperforms flat RL in a 2v2 King of the Hill environment. It matches hand-crafted behavior tree performance in win rate and is perceived as more human-like by 60% of users, highlighting effective coordination and adaptability without manual rule design.

Importance 3/3 New feature vs. leaders New harness with differentiators arXiv cs.LG OpenAI Google DeepMind Meta AI AI agents Reasoning models Training methods

Read original