Adding torch accelerator to ddp-tutorial-series example #1376

dggaytan · 2025-07-21T20:41:49Z

Adding accelerator to ddp tutorials examples

Updated ddp_setup functions in multigpu.py, multigpu_torchrun.py, and multinode.py to use torch.accelerator for device management. The initialization of process groups now dynamically selects the backend based on the device type, with a fallback to CPU if no accelerator is available.
Modified Trainer classes in multigpu_torchrun.py and multinode.py to accept a device parameter and use it for model placement and snapshot loading.

Added run_example.sh to simplify running tutorial examples with configurable GPU counts and node settings.
Updated run_distributed_examples.sh to include a new function for running all DDP tutorial series examples.

Increased the minimum PyTorch version requirement in requirements.txt to 2.7 to ensure compatibility with the new torch.accelerator API.

Signed-off-by: dggaytan <[email protected]>

netlify · 2025-07-21T20:41:54Z

Name	Link
🔨 Latest commit	`cb48338`
🔍 Latest deploy log	https://app.netlify.com/projects/pytorch-examples-preview/deploys/687ea60fd2e52400086a7789

Adding torch accelerator to ddp-tutorial-series example

cb48338

Signed-off-by: dggaytan <[email protected]>

meta-cla bot added the cla signed label Jul 21, 2025

Provide feedback