Multi-Agent Reinforcement Learning for Language-Based Social Deduction
DOI:
https://doi.org/10.71465/fair719Keywords:
Social deduction, natural language communication, multi-agent RLAbstract
This study applies MARL to train LLM agents for language-based social deduction, where communication directly influences multi-agent outcomes. Speaking policies are optimized using rewards that capture the causal impact of messages on other agents’ beliefs, while listening models predict hidden state information from dialogue. Training on 12,000 simulated game episodes results in a 2.1× increase in win rate over standard RL baselines and demonstrates emergent strategic behaviors such as coordinated accusations and evidence sharing.
Downloads
Downloads
Published
Issue
Section
License
Copyright (c) 2026 David R. Miller, Sophie L. Gagnon, James P. Wilson (Author)

This work is licensed under a Creative Commons Attribution 4.0 International License.