Skip to content

Commit d64520e

Browse files
committed
feat(diffusers/pipelines): add pipelines of skyreels_v2 in diffusers master
1 parent 46ff896 commit d64520e

28 files changed

+6942
-2
lines changed

docs/diffusers/_toctree.yml

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -232,6 +232,8 @@
232232
title: PriorTransformer
233233
- local: api/models/sd3_transformer2d
234234
title: SD3Transformer2DModel
235+
- local: api/models/skyreels_v2_transformer_3d
236+
title: SkyReelsV2Transformer3DModel
235237
- local: api/models/sana_transformer2d
236238
title: SanaTransformer2DModel
237239
- local: api/models/stable_audio_transformer
@@ -420,6 +422,8 @@
420422
title: Semantic Guidance
421423
- local: api/pipelines/shap_e
422424
title: Shap-E
425+
- local: api/pipelines/skyreels_v2
426+
title: SkyReels-V2
423427
- local: api/pipelines/stable_audio
424428
title: Stable Audio
425429
- local: api/pipelines/stable_cascade

docs/diffusers/api/loaders/lora.md

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,9 @@ LoRA is a fast and lightweight training method that inserts and trains a signifi
2424
- `SanaLoraLoaderMixin` provides similar functions for [Sana](../../api/pipelines/sana.md).
2525
- `HunyuanVideoLoraLoaderMixin` provides similar functions for [HunyuanVideo](../../api/pipelines/hunyuan_video.md).
2626
- `Lumina2LoraLoaderMixin` provides similar functions for [Lumina2](../../api/pipelines/lumina2.md).
27-
- `AmusedLoraLoaderMixin` is for the [`AmusedPipeline`].
27+
- `WanLoraLoaderMixin` provides similar functions for [Wan](../../api/pipelines/wan.md).
28+
- `SkyReelsV2LoraLoaderMixin` provides similar functions for [SkyReels-V2](../../api/pipelines/skyreels_v2.md).
29+
- `AmusedLoraLoaderMixin` is for the [AmusedPipeline](../../api/pipelines/amused.md).
2830
- `LoraBaseMixin` provides a base class with several utility methods to fuse, unfuse, unload, LoRAs and more.
2931

3032
!!! tip
@@ -52,6 +54,10 @@ LoRA is a fast and lightweight training method that inserts and trains a signifi
5254

5355
::: mindone.diffusers.loaders.lora_pipeline.Lumina2LoraLoaderMixin
5456

57+
::: mindone.diffusers.loaders.lora_pipeline.WanLoraLoaderMixin
58+
59+
::: mindone.diffusers.loaders.lora_pipeline.SkyReelsV2LoraLoaderMixin
60+
5561
::: mindone.diffusers.loaders.lora_pipeline.AmusedLoraLoaderMixin
5662

5763
::: mindone.diffusers.loaders.lora_base.LoraBaseMixin
Lines changed: 26 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,26 @@
1+
<!-- Copyright 2024 The HuggingFace Team. All rights reserved.
2+
3+
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with
4+
the License. You may obtain a copy of the License at
5+
6+
http://www.apache.org/licenses/LICENSE-2.0
7+
8+
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on
9+
an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the
10+
specific language governing permissions and limitations under the License. -->
11+
12+
# SkyReelsV2Transformer3DModel
13+
14+
A Diffusion Transformer model for 3D video-like data was introduced in [SkyReels-V2](https://github.com/SkyworkAI/SkyReels-V2) by the Skywork AI.
15+
16+
The model can be loaded with the following code snippet.
17+
18+
```python
19+
from mindone.diffusers import SkyReelsV2Transformer3DModel
20+
21+
transformer = SkyReelsV2Transformer3DModel.from_pretrained("Skywork/SkyReels-V2-DF-1.3B-540P-Diffusers", subfolder="transformer", mindspore_dtype=ms.bfloat16)
22+
```
23+
24+
::: mindone.diffusers.SkyReelsV2Transformer3DModel
25+
26+
::: mindone.diffusers.models.modeling_outputs.Transformer2DModelOutput

docs/diffusers/api/pipelines/skyreels_v2.md

Lines changed: 303 additions & 0 deletions
Large diffs are not rendered by default.

mindone/diffusers/__init__.py

Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -70,6 +70,7 @@
7070
"SD3ControlNetModel",
7171
"SD3MultiControlNetModel",
7272
"SD3Transformer2DModel",
73+
"SkyReelsV2Transformer3DModel",
7374
"SparseControlNetModel",
7475
"StableAudioDiTModel",
7576
"StableCascadeUNet",
@@ -219,6 +220,11 @@
219220
"SemanticStableDiffusionPipeline",
220221
"ShapEImg2ImgPipeline",
221222
"ShapEPipeline",
223+
"SkyReelsV2DiffusionForcingImageToVideoPipeline",
224+
"SkyReelsV2DiffusionForcingPipeline",
225+
"SkyReelsV2DiffusionForcingVideoToVideoPipeline",
226+
"SkyReelsV2ImageToVideoPipeline",
227+
"SkyReelsV2Pipeline",
222228
"StableAudioPipeline",
223229
"StableAudioProjectionModel",
224230
"StableCascadeCombinedPipeline",
@@ -399,6 +405,7 @@
399405
SD3ControlNetModel,
400406
SD3MultiControlNetModel,
401407
SD3Transformer2DModel,
408+
SkyReelsV2Transformer3DModel,
402409
SparseControlNetModel,
403410
StableAudioDiTModel,
404411
StableCascadeUNet,
@@ -547,6 +554,11 @@
547554
SemanticStableDiffusionPipeline,
548555
ShapEImg2ImgPipeline,
549556
ShapEPipeline,
557+
SkyReelsV2DiffusionForcingImageToVideoPipeline,
558+
SkyReelsV2DiffusionForcingPipeline,
559+
SkyReelsV2DiffusionForcingVideoToVideoPipeline,
560+
SkyReelsV2ImageToVideoPipeline,
561+
SkyReelsV2Pipeline,
550562
StableAudioPipeline,
551563
StableAudioProjectionModel,
552564
StableCascadeCombinedPipeline,

mindone/diffusers/loaders/__init__.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -76,6 +76,7 @@ def text_encoder_attn_modules(text_encoder):
7676
"Lumina2LoraLoaderMixin",
7777
"WanLoraLoaderMixin",
7878
"HiDreamImageLoraLoaderMixin",
79+
"SkyReelsV2LoraLoaderMixin",
7980
],
8081
"peft": ["PeftAdapterMixin"],
8182
"single_file": ["FromSingleFileMixin"],
@@ -100,6 +101,7 @@ def text_encoder_attn_modules(text_encoder):
100101
Mochi1LoraLoaderMixin,
101102
SanaLoraLoaderMixin,
102103
SD3LoraLoaderMixin,
104+
SkyReelsV2LoraLoaderMixin,
103105
StableDiffusionLoraLoaderMixin,
104106
StableDiffusionXLLoraLoaderMixin,
105107
WanLoraLoaderMixin,

0 commit comments

Comments
 (0)