Add a workflow to run vLLM unit tests on H100 #55

huydhn · 2025-07-26T02:37:52Z

This is a temporary workflow to run some vLLM unit tests on H100 until we have that runner on vLLM. As this is a workflow on PyTorch side and our H100 capacity is limited, we could only run the test:

Manually on a vLLM PR:
- For metamates, go to bunnylol oss pytorch/pytorch-integration-testing, and add yourselves to the repo which will grant you write permission to run the workflow
- Go to https://github.com/pytorch/pytorch-integration-testing/actions/workflows/vllm-ci-test.yml to run the workflow
- Set the branch name to refs/pull/<YOUR_PR_NUMBER>/head
or periodically on vLLM main commit every 4 hours
- You can get the vLLM main commit from the Docker image name in the following format public.ecr.aws/q9t5s3a7/vllm-ci-postmerge-repo:<COMMIT>, for example, GH logs

More tests can be added into .github/scripts/run_vllm_tests.sh, I use pytest -v models/multimodal/generation/test_maverick.py as a simple example. All the features like test sharding or model caching are not available at the moment, but this is a start.

Testing

https://github.com/pytorch/pytorch-integration-testing/actions/runs/16535983273/job/46770512704?pr=55#step:8:214

Signed-off-by: Huy Do <[email protected]>

Add a workflow to run vLLM unit tests on H100

cffe0cf

Signed-off-by: Huy Do <[email protected]>

huydhn had a problem deploying to pytorch-x-vllm July 26, 2025 02:37 — with GitHub Actions Error

facebook-github-bot added the cla signed label Jul 26, 2025

Use the correct test path

e4c7f44

Signed-off-by: Huy Do <[email protected]>

huydhn had a problem deploying to pytorch-x-vllm July 26, 2025 03:17 — with GitHub Actions Failure

Find the right tests

49fbdaa

Signed-off-by: Huy Do <[email protected]>

huydhn temporarily deployed to pytorch-x-vllm July 26, 2025 04:02 — with GitHub Actions Inactive

Run on linux.aws.h100.4

f8c5b85

Signed-off-by: Huy Do <[email protected]>

huydhn temporarily deployed to pytorch-x-vllm July 26, 2025 04:16 — with GitHub Actions Inactive

huydhn marked this pull request as ready for review July 26, 2025 04:16

huydhn requested review from yangw-dev and houseroad July 26, 2025 04:22

huydhn added 2 commits July 25, 2025 21:52

[no ci] Just a comment update

0a99d6e

Signed-off-by: Huy Do <[email protected]>

Update the script path

0d72e9a

Signed-off-by: Huy Do <[email protected]>

huydhn temporarily deployed to pytorch-x-vllm July 26, 2025 04:53 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add a workflow to run vLLM unit tests on H100 #55

Add a workflow to run vLLM unit tests on H100 #55

Uh oh!

huydhn commented Jul 26, 2025 •

edited

Loading

Uh oh!

Uh oh!

Add a workflow to run vLLM unit tests on H100 #55

Are you sure you want to change the base?

Add a workflow to run vLLM unit tests on H100 #55

Uh oh!

Conversation

huydhn commented Jul 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Testing

Uh oh!

Uh oh!

huydhn commented Jul 26, 2025 •

edited

Loading