Description
You Play Ball, I Play Ball: Bayesian Multi-Agent Reinforcement Learning for Slime Volleyball

Project Showcase
Click here to view interesting agent behaviour and notice the differences between agents and their Bayesian counterparts! Click here for a introductory video. Click HERE for the slide deck!
About
In Slime Volleyball, a two-player competitive game, we investigate how modelling uncertainty improves AI players’ learning in 3 ways: 1) against an expert, 2) against itself and 3) against each other, in the domain of multi-agent reinforcement learning (MARL).
We show that by modelling uncertainty, Bayesian methods improve MARL training in 4 ways: 1) performance, 2) training stability, 3) uncertainty and 4) generalisability, and through experiments using TensorFlow Probability and Stable Baselines, we present interesting differences in agent behaviour.
We contribute code for 3 functionalities: 1) Bayesian methods using Flipout integrated into Stable Baselines, 2) Multi-agent versioned learning framework for Stable Baselines (previously with only single-agent support) and 3) Uncertainty visualisation using agent clones for Slime Volleyball Gym.
More Information
Project Website - slimerl.tech
Code Repository - GitHub
Project Video - YouTube
Slime Volleyball Gym - GitHub
TensorFlow Probability - Website
Stable Baselines - Documentation