Separate Multi-agent interfaces for different formalisms

I recently wrote a proposal about multiple agent problems, and it convinced me that we should have separate interfaces (and maybe separate abstract types) for different multi-agent formalisms.

A few factors supporting this conclusion are:

1. The extensive form game and stochastic/markov game formalisms are very different, and trying to accommodate all forms in one interface will be extremely complicated to document and error-prone.
2. [Petting Zoo](https://pettingzoo.farama.org/index.html) has settled on two different interfaces (AEC for turn-based and parallel for simultaneous)
3. Originally in my mind there was some question about whether we should have different abstract types for partial observability. This would complicate an approach that had multiple multi-agent interaction because there would be a matrix of different types, i.e. `AbstractMDPEnv`, `AbstractPOMDPEnv`, `AbstractStochasticGame`, `AbstractPOSG`, etc. I now think that interaction mode is the most important concept and interfaces should be separated based on that. Partial observability is secondary and can be handled as a trait.
4. Conversion between the different formalisms through wrapping is a good solution for compatibility and should be recommended. When the current interface was designed, I thought that wrappers were a bad thing and should be avoided. But, with `provided`, `AbstractWrapper`, `QuickWrapper`, and `AutomaticDefault`, wrapping works so well that I think it is a viable solution.

This is in some ways a reversal of #43, but I think `AbstractZeroSumEnv` would not have been a good choice. Whether a game is zero-sum should be a trait.

I don't have any plans to actually make this change at the moment - this is just a note for the future.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Separate Multi-agent interfaces for different formalisms #55

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Separate Multi-agent interfaces for different formalisms #55

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions