Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion requirements/rocm-build.txt
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
# Common dependencies
-r common.txt

--extra-index-url https://download.pytorch.org/whl/rocm6.3
--extra-index-url https://download.pytorch.org/whl/rocm6.4
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

critical

While this change correctly points to the PyTorch wheel for ROCm 6.4, it introduces a critical inconsistency with the project's build environment. The base Docker image, defined in docker/Dockerfile.rocm_base, is still configured to use ROCm 6.3.1 (rocm/dev-ubuntu-22.04:6.3.1-complete).

This mismatch between the ROCm version expected by the PyTorch wheel and the version provided by the Docker image will likely lead to build failures or runtime errors. To resolve this, you must also update the BASE_IMAGE argument in docker/Dockerfile.rocm_base to a corresponding ROCm 6.4 image.

For example:

-ARG BASE_IMAGE=rocm/dev-ubuntu-22.04:6.3.1-complete
+ARG BASE_IMAGE=rocm/dev-ubuntu-22.04:6.4-complete

(Please verify the correct image tag for ROCm 6.4).

Additionally, you should check if other related dependencies in docker/Dockerfile.rocm_base, such as HIPBLASLT_BRANCH and RCCL_BRANCH, need to be updated for compatibility with ROCm 6.4.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

^see this, i think it's a good catch

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Dockerfile.rocm_base and rocm-build.txt are used in different circumstances.
rocm-build.txt is for bare metal build, without the docker.
When torch is installed from whl, as in the case with rocm-build.txt, blas libraries, and pretty much the entire rocm stack are brought together with this whl, that's why it's so bit, and also why in the normal workflow we don't use it.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Having said that, the ROCm dockerfiles here were out of sync, so I created a parallel PR: #24279

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@gshtras Out of curiosity, what is the reason we want to maintain a ROCm version of vLLM? Are there any cool features in ROCm that have not yet been brought to vLLM?

And thanks for the quick review and fix!

Copy link
Collaborator

@gshtras gshtras Sep 4, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ROCm fork is mainly deprecated, when we build our nightly dockers, it's from this upstream repo.
The only thing it's used for is the two dockerfiles, mainly due to our CI constrains (and possibly at least partially because we forgot about it)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the information : )

torch==2.8.0
torchvision==0.23.0
torchaudio==2.8.0
Expand Down