No reward in Meltingpot tasks

Hi! Thanks for the amazing work. I'm having trouble running meltingpot tasks, for example:
```
HYDRA_FULL_ERROR=1 uv run thirdparty/BenchMARL/benchmarl/run.py -m \
algorithm=mappo \
task=meltingpot/collaborative_cooking__crowded \
model=sequence \
"model.intermediate_sizes=[256]" \
"model/layers@model.layers.l1=cnn" \
"model/layers@model.layers.l2=mlp" \
model@critic_model=sequence \
"critic_model.intermediate_sizes=[256]" \
"model/layers@critic_model.layers.l1=cnn" \
"model/layers@critic_model.layers.l2=mlp" \
seed=0 \
experiment.save_folder=./outputs \
experiment.checkpoint_interval=60000 \
experiment.checkpoint_at_end=True \
experiment.sampling_device="cpu" \
experiment.train_device="cpu" \
experiment.buffer_device="cpu" \
experiment.loggers=[csv] 
```
But the rewards from the environment seems to be stuck at 0 (even after training several hours):

![Image](https://github.com/user-attachments/assets/19ee46b5-8a42-4b53-ac01-32a391b733f6)

I get rewards for other tasks (vmas, etc), I bet something is wrong with my command, maybe the cnn setup, but I can't figure out what it is. Any help would be greatly appreciated ;)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

No reward in Meltingpot tasks #196

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

No reward in Meltingpot tasks #196

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions