Skip to content

Port distributed training support from existing PR #1315

@lrzpellegrini

Description

@lrzpellegrini

I'm opening this PR to keep track of the work needed to port the content of the #996 PR to the main branch.

The idea is to split that PR (which is huge and based on a quite old version of the codebase) and, starting from the current state of the main branch, port its main elements in smaller PRs.
I'll keep this issue updated as I work on this.

Many changes are not strictly related to supporting distributed training but may benefit Avalanche in general.

  1. I'm starting with porting the modernized object detection/segmentation dataset, strategies, and metrics. I'll also port the generalized batch collate functionality.

Changes in Distributed Training PR #996:

Legend:

  • 🔲 Not ported
  • ⌛ Work in progress
  • 💬 PR opened, discussion in progress
  • ✔️ Merged into main branch

Base elements

Strategy e plugins

Models

Detection

Data Loader

Loggers and metrics

Unit tests

Typing

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions