Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 4 additions & 0 deletions docs/diffusers/_toctree.yml
Original file line number Diff line number Diff line change
Expand Up @@ -232,6 +232,8 @@
title: PriorTransformer
- local: api/models/sd3_transformer2d
title: SD3Transformer2DModel
- local: api/models/skyreels_v2_transformer_3d
title: SkyReelsV2Transformer3DModel
- local: api/models/sana_transformer2d
title: SanaTransformer2DModel
- local: api/models/stable_audio_transformer
Expand Down Expand Up @@ -420,6 +422,8 @@
title: Semantic Guidance
- local: api/pipelines/shap_e
title: Shap-E
- local: api/pipelines/skyreels_v2
title: SkyReels-V2
- local: api/pipelines/stable_audio
title: Stable Audio
- local: api/pipelines/stable_cascade
Expand Down
8 changes: 7 additions & 1 deletion docs/diffusers/api/loaders/lora.md
Original file line number Diff line number Diff line change
Expand Up @@ -24,7 +24,9 @@ LoRA is a fast and lightweight training method that inserts and trains a signifi
- `SanaLoraLoaderMixin` provides similar functions for [Sana](../../api/pipelines/sana.md).
- `HunyuanVideoLoraLoaderMixin` provides similar functions for [HunyuanVideo](../../api/pipelines/hunyuan_video.md).
- `Lumina2LoraLoaderMixin` provides similar functions for [Lumina2](../../api/pipelines/lumina2.md).
- `AmusedLoraLoaderMixin` is for the [`AmusedPipeline`].
- `WanLoraLoaderMixin` provides similar functions for [Wan](../../api/pipelines/wan.md).
- `SkyReelsV2LoraLoaderMixin` provides similar functions for [SkyReels-V2](../../api/pipelines/skyreels_v2.md).
- `AmusedLoraLoaderMixin` is for the [AmusedPipeline](../../api/pipelines/amused.md).
- `LoraBaseMixin` provides a base class with several utility methods to fuse, unfuse, unload, LoRAs and more.

!!! tip
Expand Down Expand Up @@ -52,6 +54,10 @@ LoRA is a fast and lightweight training method that inserts and trains a signifi

::: mindone.diffusers.loaders.lora_pipeline.Lumina2LoraLoaderMixin

::: mindone.diffusers.loaders.lora_pipeline.WanLoraLoaderMixin

::: mindone.diffusers.loaders.lora_pipeline.SkyReelsV2LoraLoaderMixin

::: mindone.diffusers.loaders.lora_pipeline.AmusedLoraLoaderMixin

::: mindone.diffusers.loaders.lora_base.LoraBaseMixin
26 changes: 26 additions & 0 deletions docs/diffusers/api/models/skyreels_v2_transformer_3d.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,26 @@
<!-- Copyright 2024 The HuggingFace Team. All rights reserved.

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with
the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on
an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the
specific language governing permissions and limitations under the License. -->

# SkyReelsV2Transformer3DModel

A Diffusion Transformer model for 3D video-like data was introduced in [SkyReels-V2](https://github.com/SkyworkAI/SkyReels-V2) by the Skywork AI.

The model can be loaded with the following code snippet.

```python
from mindone.diffusers import SkyReelsV2Transformer3DModel

transformer = SkyReelsV2Transformer3DModel.from_pretrained("Skywork/SkyReels-V2-DF-1.3B-540P-Diffusers", subfolder="transformer", mindspore_dtype=ms.bfloat16)
```

::: mindone.diffusers.SkyReelsV2Transformer3DModel

::: mindone.diffusers.models.modeling_outputs.Transformer2DModelOutput
303 changes: 303 additions & 0 deletions docs/diffusers/api/pipelines/skyreels_v2.md

Large diffs are not rendered by default.

12 changes: 12 additions & 0 deletions mindone/diffusers/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -70,6 +70,7 @@
"SD3ControlNetModel",
"SD3MultiControlNetModel",
"SD3Transformer2DModel",
"SkyReelsV2Transformer3DModel",
"SparseControlNetModel",
"StableAudioDiTModel",
"StableCascadeUNet",
Expand Down Expand Up @@ -219,6 +220,11 @@
"SemanticStableDiffusionPipeline",
"ShapEImg2ImgPipeline",
"ShapEPipeline",
"SkyReelsV2DiffusionForcingImageToVideoPipeline",
"SkyReelsV2DiffusionForcingPipeline",
"SkyReelsV2DiffusionForcingVideoToVideoPipeline",
"SkyReelsV2ImageToVideoPipeline",
"SkyReelsV2Pipeline",
"StableAudioPipeline",
"StableAudioProjectionModel",
"StableCascadeCombinedPipeline",
Expand Down Expand Up @@ -399,6 +405,7 @@
SD3ControlNetModel,
SD3MultiControlNetModel,
SD3Transformer2DModel,
SkyReelsV2Transformer3DModel,
SparseControlNetModel,
StableAudioDiTModel,
StableCascadeUNet,
Expand Down Expand Up @@ -547,6 +554,11 @@
SemanticStableDiffusionPipeline,
ShapEImg2ImgPipeline,
ShapEPipeline,
SkyReelsV2DiffusionForcingImageToVideoPipeline,
SkyReelsV2DiffusionForcingPipeline,
SkyReelsV2DiffusionForcingVideoToVideoPipeline,
SkyReelsV2ImageToVideoPipeline,
SkyReelsV2Pipeline,
StableAudioPipeline,
StableAudioProjectionModel,
StableCascadeCombinedPipeline,
Expand Down
2 changes: 2 additions & 0 deletions mindone/diffusers/loaders/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -76,6 +76,7 @@ def text_encoder_attn_modules(text_encoder):
"Lumina2LoraLoaderMixin",
"WanLoraLoaderMixin",
"HiDreamImageLoraLoaderMixin",
"SkyReelsV2LoraLoaderMixin",
],
"peft": ["PeftAdapterMixin"],
"single_file": ["FromSingleFileMixin"],
Expand All @@ -100,6 +101,7 @@ def text_encoder_attn_modules(text_encoder):
Mochi1LoraLoaderMixin,
SanaLoraLoaderMixin,
SD3LoraLoaderMixin,
SkyReelsV2LoraLoaderMixin,
StableDiffusionLoraLoaderMixin,
StableDiffusionXLLoraLoaderMixin,
WanLoraLoaderMixin,
Expand Down
Loading