Updated README.md for August 12 RC2 throughput results only #631

Mcirino1 · 2025-08-13T17:27:59Z

Waiting on latency results

Please direct your PRs to the upstream vllm (https://github.com/vllm-project/vllm.git)

Accepting PRs into the ROCm fork (https://github.com/ROCm/vllm) will require a clear previously communicated exception

Signed-off-by: isotr0py <[email protected]>

…2_17

…rge_25_02_17

Upstream merge 25 02 17

…odeowners (ROCm#431)

* Enabling ROCm CI on MI250 machines: - correct build target - correct queue Signed-off-by: Alexei V. Ivanov <[email protected]> --------- Signed-off-by: Alexei V. Ivanov <[email protected]>

* Optimization for quantized gemm skinny sizes * lint fix * Add support for bf16/fp16 * code cleanup * code cleanup * lint fix2 * cleanup * Moved the logic into tuned gemm to preserve API compatibility --------- Co-authored-by: Gregory Shtrasberg <[email protected]> Co-authored-by: Gregory Shtrasberg <[email protected]>

* Removing gfx940 and gfx941 targets. These have been deprecated in favor of gfx942 for MI300X Signed-off-by: Gregory Shtrasberg <[email protected]> * Remove from custom kernels as well --------- Signed-off-by: Gregory Shtrasberg <[email protected]>

Signed-off-by: Divakar Verma <[email protected]>

* Advance torch commit to be past pytorch/pytorch#144942 to fix tunable ops * Make sure to use the submodule commit compatible with the main aiter commit

…t is fixed (ROCm#443)

Signed-off-by: Sage Moore <[email protected]>

…2_24

…m_merge_25_02_24

Upstream merge 25 02 24

* Using aiter branch that can be built into a whl with PREBUILD_KERNELS=1 * Using fail fast on aiter build to see compilation errors in the log since it fails silently * Check for build success without installing whl

* Using proposed fix from ROCm/aiter#115 * Build fix

* tuning adjustment for quantized skinny gemm. * lint fix

…3_03

)" This reverts commit 8294773.

Upstream merge 2025 06 23

Upstream merge 2025 06 25

Upstream merge 2025 06 30

* Updated README.md for June 24 Docker release * Added additional throughput results * Fixed some throughput results

* Minor changes to command line examples * README changes and added throughput results Still waiting on latency * Added latency results * Update README.md * Update README.md

* Update test-pipeline.yaml Disabling the "Tensorizer Test". The test is seen to generate exceptions while still reporting as successful. That needs to be verified before re-enabling the test in the production environment. Signed-off-by: Alexei V. Ivanov <[email protected]> * Fixing pre-commit complaints. Signed-off-by: Alexei V. Ivanov <[email protected]> * . Signed-off-by: Alexei V. Ivanov <[email protected]> --------- Signed-off-by: Alexei V. Ivanov <[email protected]>

…symbol exposure (vllm-project#21647)" This reverts commit 9ba1c88. Signed-off-by: Gregory Shtrasberg <[email protected]>

…merge_2025_07_29

Upstream merge 2025 07 29

Waiting on latency results

gshtras · 2025-08-13T17:36:19Z

docs/dev-docker/README.md

    cd vllm
-    git checkout b432b7a285aa0dcb9677380936ffa74931bb6d6f
+    git checkout 340ea86dfe5955d6f9a9e767d6abab5aacf2c978
    docker build -f docker/Dockerfile.rocm -t <your_tag> --build-arg USE_CYTHON=1 .


Need to remove this --build-arg USE_CYTHON=1 part since a few releases back

Removed it for both build commands, but let me know if we still need it for the other one. Thanks!

The way it is now is correct, thank you

The merge-base changed after approval.

Isotr0py and others added 30 commits February 16, 2025 14:28

avoid calling hf_list_repo_files for local model

ccaff7f

Signed-off-by: isotr0py <[email protected]>

annotation

7cc05dd

Signed-off-by: isotr0py <[email protected]>

Merge remote-tracking branch 'upstream/main' into upstream_merge_25_0…

ce342c7

…2_17

Merge remote-tracking branch 'Isotr0py/local-lookup' into upstream_me…

669fc3f

…rge_25_02_17

Merge pull request ROCm#430 from ROCm/upstream_merge_25_02_17

365687d

Upstream merge 25 02 17

Updating PR template to point people to the upstream repo. Updating c…

4fd2f5b

…odeowners (ROCm#431)

Enabling the ROCm-vLLM CI on MI250 machines (ROCm#432)

17b26bd

* Enabling ROCm CI on MI250 machines: - correct build target - correct queue Signed-off-by: Alexei V. Ivanov <[email protected]> --------- Signed-off-by: Alexei V. Ivanov <[email protected]>

Restricting FP8 wvSplitk to MI300x (ROCm#439)

b63a984

Remove mi300a (ROCm#440)

39456f3

* Removing gfx940 and gfx941 targets. These have been deprecated in favor of gfx942 for MI300X Signed-off-by: Gregory Shtrasberg <[email protected]> * Remove from custom kernels as well --------- Signed-off-by: Gregory Shtrasberg <[email protected]>

resolve diff for mixtral8x7B configs (ROCm#437)

5a6afcc

Signed-off-by: Divakar Verma <[email protected]>

Torch version bump to fix tunable ops (ROCm#442)

ff13c7a

* Advance torch commit to be past pytorch/pytorch#144942 to fix tunable ops * Make sure to use the submodule commit compatible with the main aiter commit

Using AITER branch with fixed whl. Disabling PREBUILD_KERNELS until i…

cea7419

…t is fixed (ROCm#443)

Bump hipblaslt version. Minor fixes to printing the versions (ROCm#447)

118296d

Bumping the version in the right place (ROCm#448)

18689d8

init

07336d2

Signed-off-by: Sage Moore <[email protected]>

init

c226a30

Signed-off-by: Sage Moore <[email protected]>

update logs

ae3594e

Signed-off-by: Sage Moore <[email protected]>

Merge remote-tracking branch 'upstream/main' into upstream_merge_25_0…

92a2279

…2_24

Merge remote-tracking branch 'nm/sage/deepseek-rocm-fix' into upstrea…

8230388

…m_merge_25_02_24

Merge branch 'main' into upstream_merge_25_02_24

d619b41

Fix test that was missed by local linters

46c1c97

Merge pull request ROCm#449 from ROCm/upstream_merge_25_02_24

ba6f019

Upstream merge 25 02 24

Stable aiter build (ROCm#450)

b5a4a37

* Using aiter branch that can be built into a whl with PREBUILD_KERNELS=1 * Using fail fast on aiter build to see compilation errors in the log since it fails silently * Check for build success without installing whl

Remove batch padding on ROCm (ROCm#451)

f932181

Aiter whl fix branch (ROCm#452)

386763c

* Using proposed fix from ROCm/aiter#115 * Build fix

tuning adjustment for quantized skinny gemm. (ROCm#444)

fd70f59

* tuning adjustment for quantized skinny gemm. * lint fix

Merge remote-tracking branch 'upstream/main' into upstream_merge_25_0…

24c6283

…3_03

Revert "[core] Perf improvement for DSv3 on AMD GPUs (vllm-project#13718

87bf00a

)" This reverts commit 8294773.

using list for typing

7cd9ea1

gshtras and others added 16 commits June 23, 2025 12:40

Merge pull request ROCm#581 from ROCm/upstream_merge_2025_06_23

c4258f4

Upstream merge 2025 06 23

Merge remote-tracking branch 'upstream/main'

52741bd

Merge remote-tracking branch 'hyoon1/remove_unused_var'

4ed2d76

Merge pull request ROCm#583 from ROCm/upstream_merge_2025_06_25

1f85814

Upstream merge 2025 06 25

Merge remote-tracking branch 'upstream/main'

d171777

Merge pull request ROCm#586 from ROCm/upstream_merge_2025_06_30

0f7ec48

Upstream merge 2025 06 30

Updated README.md for June 24 Docker release (ROCm#589)

5486e7b

* Updated README.md for June 24 Docker release * Added additional throughput results * Fixed some throughput results

Minor changes to command line examples (ROCm#594)

f94ec9b

* Minor changes to command line examples * README changes and added throughput results Still waiting on latency * Added latency results * Update README.md * Update README.md

Merge remote-tracking branch 'upstream/main'

753b68c

Revert "[AMD][CI/Build] Fix the AMD issue caused by inappropriate of …

10aaf0b

…symbol exposure (vllm-project#21647)" This reverts commit 9ba1c88. Signed-off-by: Gregory Shtrasberg <[email protected]>

cleanup

3a64780

Merge remote-tracking branch 'origin/revert_wrong_fix' into upstream_…

4fe15a8

…merge_2025_07_29

Merge pull request ROCm#613 from ROCm/upstream_merge_2025_07_29

b6ddf62

Upstream merge 2025 07 29

Update the base dockerfile to match the one actually built (ROCm#623)

dfe3216

Updated README.md for August 12 RC2 throughput results only

8bd11e2

Waiting on latency results

gshtras reviewed Aug 13, 2025

View reviewed changes

Mcirino1 added 4 commits August 13, 2025 11:15

Remove CYTHON=1

e71c051

Update README.md

040de12

Added partial latency results

c936f5e

Added missing latency results

7802e2b

Mcirino1 marked this pull request as ready for review August 14, 2025 15:17

Mcirino1 requested review from shajrawi, maleksan85, sunway513 and hongxiayang as code owners August 14, 2025 15:17

shajrawi previously approved these changes Aug 14, 2025

View reviewed changes

gshtras force-pushed the main branch 2 times, most recently from 1d2c43d to eb9d4de Compare September 9, 2025 16:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Updated README.md for August 12 RC2 throughput results only #631

Updated README.md for August 12 RC2 throughput results only #631

Uh oh!

Mcirino1 commented Aug 13, 2025 •

edited by github-actions bot

Loading

Uh oh!

gshtras Aug 13, 2025

Uh oh!

Mcirino1 Aug 13, 2025

Uh oh!

gshtras Aug 13, 2025

Uh oh!

Uh oh!

Updated README.md for August 12 RC2 throughput results only #631

Are you sure you want to change the base?

Updated README.md for August 12 RC2 throughput results only #631

Uh oh!

Conversation

Mcirino1 commented Aug 13, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gshtras Aug 13, 2025

Choose a reason for hiding this comment

Uh oh!

Mcirino1 Aug 13, 2025

Choose a reason for hiding this comment

Uh oh!

gshtras Aug 13, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Mcirino1 commented Aug 13, 2025 •

edited by github-actions bot

Loading