When Is Diversity Rewarded in Cooperative Multi-Agent Learning?

Code accompanying the paper "When Is Diversity Rewarded in Cooperative Multi-Agent Learning?" presented at ICLR 2026.

@inproceedings{bettini2026hetenvdesign,
  title={When Is Diversity Rewarded in Cooperative Multi-Agent Learning?},
  author={Amir, Michael and Bettini, Matteo and Prorok, Amanda},
  year={2026},
  booktitle={International Conference on Learning Representations (ICLR)},
}

This supplementary materials contains our code, and configuration files that enable one to replicate all of our experiments and figures exactly as they appear in the paper.

Installing

Create a virtual environment with python 3.11 (e.g., conda create --name env python=3.11 with conda)
Install dependencies

pip install torch==2.5 hydra-core torch_geometric wandb moviepy matplotlib==3.8

Install our versions of VMAS, TensorDict, and TorchRL.

git clone -b het_env_design https://github.com/proroklab/VectorizedMultiAgentSimulator.git
pip install -e VectorizedMultiAgentSimulator

git clone -b het_env_design https://github.com/matteobettini/tensordict.git
cd tensordict
python setup.py develop
cd ..

git clone -b het_env_design https://github.com/matteobettini/rl.git
cd rl
python setup.py develop
cd ..

Install optional dependencies for logging

pip installl wandb moviepy

Try running a script (if you have a gpu, it will use it)

python HetEnvDesign/matrix_game_cont.py

Running

Each experiment in the paper has a corresponding python file and yaml configuration folder (under the conf folder), containing its hyper-parameters.

Here is how to reproduce the experiments (any parameters in the corresponding folder can be passed).

-m performs a multirun, see the hydra docs for more info.

One-shot Matrix Game

This is the ``matrix game'' considered in our experiments section.

Running

Continuous:

python HetEnvDesign/matrix_game_cont.py -m env.scenario.gen_agg_type_task=min,mean,max env.scenario.gen_agg_type_agent=min,mean,max seed=0,1,2,3,4,5,6,7,8

Discrete:

python HetEnvDesign/matrix_game_disc.py -m env.scenario.gen_agg_type_task=min,mean,max env.scenario.gen_agg_type_agent=min,mean,max seed=0,1,2,3,4,5,6,7,8

Plotting

Once run, you can plot the results from wandb (it will create the pdf used in the paper)

python HetEnvDesign/plotting/plot_matrix_games.py

Multi-Goal Capture

This is the embodied environment studied in our experiments section.

Running

python HetEnvDesign/goal_navigation_vanilla.py -m env.scenario.gen_agg_type_task=min,mean,max env.scenario.gen_agg_type_agent=min,mean,max seed=0,1,2,3,4,5,6,7,8

Plotting

Once run, you can plot the results from wandb (it will create the pdf used in the paper)

python HetEnvDesign/plotting/plot_ctf_embodied_vanilla.py

Multi-Goal Capture with Local Sensing

This runs the experiments where agents are equipped with a range sensor of increasing radius.

Running

python HetEnvDesign/goal_navigation_lidar.py -m env.scenario.lidar_range=0,0.1,0.2,0.35 seed=0,1,2,3

Plotting

Once run, you can plot the results from wandb (it will create the pdf used in the paper)

python HetEnvDesign/plotting/plot_ctf_embodied_lidar.py

Heterogeneous Environment Design (HED) experiments

These are the co-design HED experiments where HED optimizes the reward structure to favour heterogeneity.

Running

Softmax:

python HetEnvDesign/goal_navigation_hed_softmax.py -m seed=0,1,2,3,4,5,6,7,8,9,10,11,12

Power-Sum:

python HetEnvDesign/goal_navigation_hed_powersum.py -m seed=0,1,2,3,4,5,6,7,8,9,10,11,12

Plotting

Once run, you can plot the results from wandb (it will create the pdf used in the paper)

python HetEnvDesign/plotting/plot_ctf_embodied_softmax_design.py
python HetEnvDesign/plotting/plot_ctf_embodied_powersum_design.py

Configuring

The configuration for the codebase is available in the HetEnvDesign/conf folder.

Each parameter can be changed from the yaml file or from command line using the hydra syntax, see the hydra docs for more info.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
HetEnvDesign		HetEnvDesign
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

When Is Diversity Rewarded in Cooperative Multi-Agent Learning?

Installing

Running

One-shot Matrix Game

Running

Plotting

Multi-Goal Capture

Running

Plotting

Multi-Goal Capture with Local Sensing

Running

Plotting

Heterogeneous Environment Design (HED) experiments

Running

Plotting

Configuring

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

When Is Diversity Rewarded in Cooperative Multi-Agent Learning?

Installing

Running

One-shot Matrix Game

Running

Plotting

Multi-Goal Capture

Running

Plotting

Multi-Goal Capture with Local Sensing

Running

Plotting

Heterogeneous Environment Design (HED) experiments

Running

Plotting

Configuring

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages