Support unittest for Z-image ⚡️ #12715

JerryWu-code · 2025-11-26T03:59:28Z

What does this PR do?

This PR adds unittest for Z-image Series⚡️ as discussed in #12703 (comment). Z-Image-Turbo, the distillation variant of our Z-Image, could generate 1K resolution photorealistic photo while excels in complex en/zh text rendering within 1-second in H800/H100 cards in bf16-precision.

Z-Image is a powerful and highly efficient 6B-parameter image generation model that is friendly for consumer-grade hardware, with strong capabilities in photorealistic image generation, accurate rendering of both complex Chinese and English text, and robust adherence to bilingual instructions.

The technical report and Z-Image-Turbo checkpoint will be released very soon !!!

Thanks for the support of @yiyixuxu.

Fixes # (issue)

Fix bugs for num_images_per_prompt with actual batch.
Refine unitest and skip for cases needed separate test env; Fix compatibility with unitest in model, mostly precision formating.
Add clean environment for test_save_load_float16 separate environment test and add notes for that; Styling.
Merge remote main branch for easy integration.

…, Remove once func in pipeline.

…ryWu-code/z-image # Conflicts: # src/diffusers/models/transformers/transformer_z_image.py

…peat; Add hint for attn processor.

…ace its origin implement; Add DocString in pipeline for that.

…rd, replace its origin implement; Add DocString in pipeline for that." This reverts commit fbf26b7.

…al commit for fa3 compatibility.

… pre-encode as List of torch Tensor.

…tibility with unitest in model, mostly precision formating.

HuggingFaceDocBuilderDev · 2025-11-26T04:59:24Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

yiyixuxu

thanks so much for the PR! I left some small suggestions:)
let me know what you think

yiyixuxu · 2025-11-26T06:13:50Z

src/diffusers/models/transformers/transformer_z_image.py

        t_freq = self.timestep_embedding(t, self.frequency_embedding_size)
-        t_emb = self.mlp(t_freq.to(self.mlp[0].weight.dtype))
+        weight_dtype = self.mlp[0].weight.dtype
+        if weight_dtype in [torch.float32, torch.float16, torch.bfloat16]:


Suggested change

if weight_dtype in [torch.float32, torch.float16, torch.bfloat16]:

if weight_dtype.is_floating_point:

Sure, initially change this for compatible with precision autocasting, but yeah "is_floating_point" works ~

yiyixuxu · 2025-11-26T06:20:32Z

src/diffusers/pipelines/z_image/pipeline_z_image.py

-        assert self.dtype == torch.bfloat16
-        dtype = self.dtype
+        # assert self.dtype == torch.bfloat16
+        dtype = self.dtype if hasattr(self, "dtype") and self.dtype is not None else torch.float32


we usually don't use self.dtype , this is the logic behind it https://github.com/huggingface/diffusers/blob/main/src/diffusers/pipelines/pipeline_utils.py#L578

you can see that it is not very useful when components can sometimes have different dtype
so instead, we try to use specific dtype at each step, e.g. you will see a lot of patterns like this
https://github.com/huggingface/diffusers/blob/main/src/diffusers/pipelines/flux/pipeline_flux.py#L257

prompt_embeds = encode_prompt(..., dtype= self.text_encoder.dtype)

or

latents = prepare_latents(..., dtype=torch.float32)

Perfect !! 😊 We've changed to trying to get dtype of the components of pipeline as your mentioned in the first format. Already styling may ready to ort merge ❤️

JerryWu-code · 2025-11-26T09:35:58Z

Hi yiyi, this commit e277137 is ready to merged !! 🚀🚀🚀 Thanks 😊

JerryWu-code and others added 30 commits November 23, 2025 19:54

Add Support for Z-Image.

42658fa

Reformatting with make style, black & isort.

3e74bb2

Remove init, Modify import utils, Merge forward in transformers block…

a4b89a0

…, Remove once func in pipeline.

modified main model forward, freqs_cis left

7df350d

Merge remote-tracking branch 'JerryWu-code/z-image-dev' into fork/Jer…

1dd587b

…ryWu-code/z-image # Conflicts: # src/diffusers/models/transformers/transformer_z_image.py

refactored to add B dim

aae03cf

fixed stack issue

21d8130

fixed modulation bug

e3dfa9e

fixed modulation bug

a7fa731

fix bug

1e0cefe

remove value_from_time_aware_config

7adaae8

styling

5b4c907

Fix neg embed and devide / bug; Reuse pad zero tensor; Turn cat -> re…

2bb39f4

…peat; Add hint for attn processor.

Replace padding with pad_sequence; Add gradient checkpointing.

71e8049

Fix flash_attn3 in dispatch attn backend by _flash_attn_forward, repl…

fbf26b7

…ace its origin implement; Add DocString in pipeline for that.

Fix Docstring and Make Style.

6c0c059

Revert "Fix flash_attn3 in dispatch attn backend by _flash_attn_forwa…

28685dd

…rd, replace its origin implement; Add DocString in pipeline for that." This reverts commit fbf26b7.

update z-image docstring

8e391b7

Revert attention dispatcher

3b22e84

update z-image docstring

3d1a7aa

styling

336c5ce

Recover attention_dispatch.py with its origin impl, later would speci…

38a89ed

…al commit for fa3 compatibility.

Fix prev bug, and support for prompt_embeds pass in args after prompt…

69d61e5

… pre-encode as List of torch Tensor.

Merge branch 'z-image-dev-ql' into z-image-dev

549ad57

Remove einop dependency.

1dd8f3c

Merge branch 'z-image-dev' into z-image

2f2d8c3

Merge remote-tracking branch 'origin/main' into z-image

a74a0c4

remove redundant imports & make fix-copies

e49a1f9

fix import

1048d0a

Support for num_images_per_prompt>1; Remove redundant unquote variables.

266e169

JerryWu-code added 6 commits November 25, 2025 21:09

Fix bugs for num_images_per_prompt with actual batch.

12d2fb2

Add unit tests for Z-Image.

9a049f0

Refine unitest and skip for cases needed separate test env; Fix compa…

c4e4a57

…tibility with unitest in model, mostly precision formating.

Add clean env for test_save_load_float16 separ test; Add Note; Styling.

6f2808b

Merge current branch into ours for next pr compatibility.

e48060c

Merge branch 'main' into z-image

27a37cd

JerryWu-code mentioned this pull request Nov 26, 2025

Add Support for Z-Image Series #12703

Merged

yiyixuxu reviewed Nov 26, 2025

View reviewed changes

JerryWu-code added 2 commits November 26, 2025 09:15

Update dtype mentioned by yiyi.

aeed890

Merge branch 'main' into z-image

e277137

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support unittest for Z-image ⚡️ #12715

Support unittest for Z-image ⚡️ #12715

JerryWu-code commented Nov 26, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Nov 26, 2025

Uh oh!

yiyixuxu left a comment

Uh oh!

yiyixuxu Nov 26, 2025

Uh oh!

JerryWu-code Nov 26, 2025

Uh oh!

yiyixuxu Nov 26, 2025

Uh oh!

JerryWu-code Nov 26, 2025

Uh oh!

JerryWu-code commented Nov 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	if weight_dtype in [torch.float32, torch.float16, torch.bfloat16]:
	if weight_dtype.is_floating_point:

Support unittest for Z-image ⚡️ #12715

Are you sure you want to change the base?

Support unittest for Z-image ⚡️ #12715

Conversation

JerryWu-code commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Nov 26, 2025

Uh oh!

yiyixuxu left a comment

Choose a reason for hiding this comment

Uh oh!

yiyixuxu Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

JerryWu-code Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

yiyixuxu Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

JerryWu-code Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

JerryWu-code commented Nov 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

JerryWu-code commented Nov 26, 2025 •

edited

Loading