[Triton] 355 wip fused triton #666

k50112113 · 2025-09-10T17:41:58Z

This PR contains launch-bound triton fusion kernel optimization,

including DS-V3 FP8, LL-70B (rope+kv_cache), gpt-oss-120B (rope+kv_cahce),

for GPT-OSS, the following triton compiler change is required:

Change TRITON_HIP_PRESHUFFLE_SCALES default by cagrikymk · Pull Request #877 · ROCm/triton

k50112113 · 2025-09-10T17:42:15Z

See ROCm/aiter#984

…ant + FP8 batched GEMM

…_FP8_QUANT to DS-V3 top-level

k50112113 mentioned this pull request Sep 10, 2025

[Triton] 355 wip fused triton ROCm/aiter#984

Merged

dllehr-amd force-pushed the 355_wip branch from 88f141e to 0e3f0fc Compare September 10, 2025 20:23

dllehr-amd requested a review from gshtras as a code owner September 10, 2025 20:23

dllehr-amd self-requested a review September 10, 2025 20:30

dllehr-amd approved these changes Sep 10, 2025

View reviewed changes

k50112113 and others added 14 commits September 10, 2025 21:01

integrate fused_rms_fp8_group_quant and fused_mul_add for DS-V3

03402d1

add fused rope + zeros + reshape_and_cache and FP8 per-token group qu…

4bf1a0c

…ant + FP8 batched GEMM

add env var for aiter/triton fp8 gemm switch

517f4d4

add fused silu mul quant and post_attention rms quant

e132973

workaround

9d73620

add fused kv cache for gpt-oss

ee3553e

remove fused split rope

32bbe45

set env var default to 1 and move VLLM_ROCM_USE_AITER_TRITON_SILU_MUL…

00b1b6d

…_FP8_QUANT to DS-V3 top-level

change env var name

500f3c2

fix bug

1f4d389

update GPT-OSS related env variables

4335d8a

change default for hip preshuffle

e19c5b1

clean up, remove comments

aa71cd2

fix bug, clean up

b6415ea

k50112113 force-pushed the 355_wip_fused_triton branch from 3f0d1a2 to b6415ea Compare September 10, 2025 23:22

dllehr-amd approved these changes Sep 11, 2025

View reviewed changes

dllehr-amd merged commit b7a1826 into 355_wip Sep 11, 2025
3 of 4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Triton] 355 wip fused triton #666

[Triton] 355 wip fused triton #666

Uh oh!

k50112113 commented Sep 10, 2025 •

edited by github-actions bot

Loading

Uh oh!

k50112113 commented Sep 10, 2025

Uh oh!

Uh oh!

Uh oh!

[Triton] 355 wip fused triton #666

[Triton] 355 wip fused triton #666

Uh oh!

Conversation

k50112113 commented Sep 10, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

k50112113 commented Sep 10, 2025

Uh oh!

Uh oh!

Uh oh!

k50112113 commented Sep 10, 2025 •

edited by github-actions bot

Loading