[WIP] SVDQuant #11950

DerekLiu35 · 2025-07-17T21:10:52Z

What does this PR do?

right now just loads pre-quantized nunchaku model.

# INT-4 SVDQuant
from diffusers import FluxPipeline, FluxTransformer2DModel
from diffusers.quantizers.quantization_config import SVDQuantConfig
import torch

ckpt_id   = "black-forest-labs/FLUX.1-dev"
quant_id  = "mit-han-lab/svdq-int4-flux.1-dev"

transformer = FluxTransformer2DModel.from_single_file(
    quant_id,
    quantization_config=SVDQuantConfig(),
    torch_dtype=torch.bfloat16,
    device_map="cuda",
)

pipe = FluxPipeline.from_pretrained(
    ckpt_id,
    transformer=transformer,
    torch_dtype=torch.bfloat16,
).to("cuda")

pipe_kwargs = {
    "prompt": "A cat holding a sign that says hello world",
    "height": 1024,
    "width": 1024,
    "guidance_scale": 3.5,
    "num_inference_steps": 50,
}
image = pipe(generator=torch.manual_seed(0), **pipe_kwargs).images[0]
image.save("svdq_int4.png")


# BF16 baseline
pipe = FluxPipeline.from_pretrained(
    ckpt_id,
    torch_dtype=torch.bfloat16,
).to("cuda")

image = pipe(generator=torch.manual_seed(0), **pipe_kwargs).images[0]
image.save("bf16.png")

HuggingFaceDocBuilderDev · 2025-07-17T21:18:02Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

init

b5bf4cf

fix style

1a1857c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] SVDQuant #11950

[WIP] SVDQuant #11950

Uh oh!

DerekLiu35 commented Jul 17, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Jul 17, 2025

Uh oh!

Uh oh!

[WIP] SVDQuant #11950

Are you sure you want to change the base?

[WIP] SVDQuant #11950

Uh oh!

Conversation

DerekLiu35 commented Jul 17, 2025

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Jul 17, 2025

Uh oh!

Uh oh!