Skip to content

No reward in Meltingpot tasks #196

@simonpedrogonzalez

Description

@simonpedrogonzalez

Hi! Thanks for the amazing work. I'm having trouble running meltingpot tasks, for example:

HYDRA_FULL_ERROR=1 uv run thirdparty/BenchMARL/benchmarl/run.py -m \
algorithm=mappo \
task=meltingpot/collaborative_cooking__crowded \
model=sequence \
"model.intermediate_sizes=[256]" \
"model/[email protected]=cnn" \
"model/[email protected]=mlp" \
model@critic_model=sequence \
"critic_model.intermediate_sizes=[256]" \
"model/layers@critic_model.layers.l1=cnn" \
"model/layers@critic_model.layers.l2=mlp" \
seed=0 \
experiment.save_folder=./outputs \
experiment.checkpoint_interval=60000 \
experiment.checkpoint_at_end=True \
experiment.sampling_device="cpu" \
experiment.train_device="cpu" \
experiment.buffer_device="cpu" \
experiment.loggers=[csv] 

But the rewards from the environment seems to be stuck at 0 (even after training several hours):

Image

I get rewards for other tasks (vmas, etc), I bet something is wrong with my command, maybe the cnn setup, but I can't figure out what it is. Any help would be greatly appreciated ;)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions