[docs] Parallel loading of shards #12135

stevhliu · 2025-08-13T03:40:17Z

Docs to go along with #12028.

I didn't mention #11904 since it doesn't appear to require any action on a user's behalf and it just works in the background. It's a cool design/implementation detail though and I think it'd be pretty interesting to maybe do a blog post about optimizations like this.

HuggingFaceDocBuilderDev · 2025-08-13T03:47:40Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul · 2025-08-13T03:50:53Z

docs/source/en/using-diffusers/loading.md

+pipeline = DiffusionPipeline.from_pretrained(
+    "Wan-AI/Wan2.2-I2V-A14B-Diffusers",
+    torch_dtype=torch.bfloat16,
+    device_map="cuda"


Do we want to talk a bit about device_map? The motivation behind passing device_map essentially comes from this PR: #11904. I have also tried providing a bit of motivation behind adding it at the pipeline-level here: #12122 (comment).

sayakpaul · 2025-08-14T02:12:30Z

I didn't mention #11904 since it doesn't appear to require any action on a user's behalf and it just works in the background. It's a cool design/implementation detail though and I think it'd be pretty interesting to maybe do a blog post about optimizations like this.

@stevhliu it comes to fruition when we initialize the model directly on the accelerator device through device_map instead of doing to("cuda"). So, it could be important to mention I guess.

docs/source/en/using-diffusers/loading.md

initial

b0dd3b7

stevhliu requested a review from sayakpaul August 13, 2025 03:47

sayakpaul reviewed Aug 13, 2025

View reviewed changes

stevhliu and others added 2 commits August 13, 2025 10:52

feedback

e06b21f

Merge branch 'main' into parallel-loading

9a9fd95

sayakpaul approved these changes Aug 14, 2025

View reviewed changes

Merge branch 'main' into parallel-loading

2f42f08

sayakpaul approved these changes Aug 14, 2025

View reviewed changes

docs/source/en/using-diffusers/loading.md Outdated Show resolved Hide resolved

Update docs/source/en/using-diffusers/loading.md

ca661ef

sayakpaul merged commit 421ee07 into huggingface:main Aug 14, 2025
1 check passed

stevhliu deleted the parallel-loading branch August 14, 2025 16:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[docs] Parallel loading of shards #12135

[docs] Parallel loading of shards #12135

Uh oh!

stevhliu commented Aug 13, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Aug 13, 2025

Uh oh!

sayakpaul Aug 13, 2025

Uh oh!

sayakpaul commented Aug 14, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[docs] Parallel loading of shards #12135

[docs] Parallel loading of shards #12135

Uh oh!

Conversation

stevhliu commented Aug 13, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Aug 13, 2025

Uh oh!

sayakpaul Aug 13, 2025

Choose a reason for hiding this comment

Uh oh!

sayakpaul commented Aug 14, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!