Mirroring changes in test-pipeline.yaml into test-amd.yaml #27242

Alexei-V-Ivanov-AMD · 2025-10-21T03:03:53Z

Mirroring changes in test-pipeline.yaml into test-amd.yaml

Signed-off-by: Alexei V. Ivanov [email protected]

Signed-off-by: Alexei V. Ivanov <[email protected]>

gemini-code-assist

Code Review

This pull request mirrors changes from test-pipeline.yaml into test-amd.yaml. While many changes are valid, several NVIDIA-specific test configurations (for Blackwell and H200 GPUs) have been incorrectly copied into the AMD pipeline file. These steps will fail on AMD hardware and should be removed. There is also a CUDA-specific test dependency that should be removed.

gemini-code-assist · 2025-10-21T03:05:41Z

.buildkite/test-amd.yaml

+- label: Blackwell Test # 21 min
+  timeout_in_minutes: 30


While this change updates the label and timeout, this entire 'Blackwell Test' step is configured for NVIDIA hardware (gpu: b200 on line 943) and is not suitable for the test-amd.yaml pipeline. This step should be removed entirely from this file.

gemini-code-assist · 2025-10-21T03:05:41Z

.buildkite/test-amd.yaml

+- label: Blackwell Fusion Tests # 30 min
+  timeout_in_minutes: 40
+  working_dir: "/vllm-workspace/"
+  gpu: b200
+  source_file_dependencies:
+  - csrc/quantization/fp4/
+  - vllm/model_executor/layers/quantization/utils/flashinfer_utils.py
+  - vllm/v1/attention/backends/flashinfer.py
+  - vllm/compilation/
+  # can affect pattern matching
+  - vllm/model_executor/layers/layernorm.py
+  - vllm/model_executor/layers/activation.py
+  - vllm/model_executor/layers/quantization/input_quant_fp8.py
+  commands:
+    - nvidia-smi
+    - pytest -v -s tests/compile/test_fusion_attn.py
    - pytest -v -s tests/compile/test_silu_mul_quant_fusion.py
+    # this runner has 2 GPUs available even though num_gpus=2 is not set
+    - pytest -v -s tests/compile/test_fusion_all_reduce.py
+    - pytest -v -s tests/compile/test_fusions_e2e.py


This newly added 'Blackwell Fusion Tests' step is configured for NVIDIA Blackwell (B200) GPUs and includes the nvidia-smi command. This is incorrect for the test-amd.yaml pipeline, which is intended for AMD hardware. This entire step should be removed.

gemini-code-assist · 2025-10-21T03:05:41Z

.buildkite/test-amd.yaml

+- label: Distributed Tests (H200) # optional
  gpu: h200
  optional: true
  working_dir: "/vllm-workspace/"
  num_gpus: 2
  commands:
+    - pytest -v -s tests/compile/test_async_tp.py
+    - pytest -v -s tests/compile/test_sequence_parallelism.py
+    - pytest -v -s tests/compile/test_fusion_all_reduce.py
+    - pytest -v -s tests/compile/test_fusions_e2e.py::test_tp2_attn_quant_allreduce_rmsnorm
    - pytest -v -s tests/distributed/test_context_parallel.py
    - CUDA_VISIBLE_DEVICES=1,2 VLLM_ALL2ALL_BACKEND=deepep_high_throughput VLLM_USE_DEEP_GEMM=1 VLLM_LOGGING_LEVEL=DEBUG python3 examples/offline_inference/data_parallel.py --model Qwen/Qwen1.5-MoE-A2.7B --tp-size=1  --dp-size=2 --max-model-len 2048


This test step is configured for NVIDIA H200 GPUs (gpu: h200) and is not suitable for the test-amd.yaml pipeline. While the changes mirror another file, this entire step, including the typo fix and newly added tests, should be removed from the AMD-specific configuration.

gemini-code-assist · 2025-10-21T03:05:41Z

.buildkite/test-amd.yaml

  source_file_dependencies:
  - csrc/
  - tests/kernels/core
+  - tests/kernels/test_top_k_per_row.py


The test file tests/kernels/test_top_k_per_row.py is CUDA-specific, as indicated by @pytest.mark.skipif(not current_platform.is_cuda(), reason="This test requires CUDA") within the file. Including this as a source dependency in test-amd.yaml is incorrect. This dependency should be removed from the AMD pipeline.

zhewenl · 2025-10-21T03:45:47Z

.buildkite/test-amd.yaml

    - cd .. && VLLM_WORKER_MULTIPROC_METHOD=spawn pytest -v -s tests/models/multimodal/generation/test_whisper.py -m core_model  # Otherwise, mp_method="spawn" doesn't work

+- label: Multi-Modal Accuracy Eval (Small Models) # 50min
+  mirror_hardwares: [amdexperimental]


why don't we use amd production to enable this in actual CI?

This pipeline in its current state is meant to be for monitoring only and not gating for the PRs

zhewenl · 2025-10-21T03:46:27Z

.buildkite/test-amd.yaml


- label: Blackwell Test # 38 min
-  timeout_in_minutes: 60
+- label: Blackwell Test # 21 min


Do we still need to test Blackwell on this AMD specific pipeline?

We include that test group for completeness reasons. For the moment it is just a comment and not an instruction for action in test-amd.yaml. But things may change in the future.

gshtras

Assuming gpu: h200 are skipped in this pipeline and not put load on the actual h200 nodes

Mirroring changes in test-pipeline.yaml

2374c09

Signed-off-by: Alexei V. Ivanov <[email protected]>

mergify bot added ci/build rocm Related to AMD ROCm labels Oct 21, 2025

gemini-code-assist bot reviewed Oct 21, 2025

View reviewed changes

zhewenl reviewed Oct 21, 2025

View reviewed changes

gshtras approved these changes Oct 21, 2025

View reviewed changes

gshtras added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 21, 2025

Alexei-V-Ivanov-AMD added 10 commits October 21, 2025 10:40

Merge branch 'main' into MAIN_20251020

f6fa133

Merge branch 'main' into MAIN_20251020

e1f6a57

Merge branch 'main' into MAIN_20251020

dea24fe

Merge branch 'main' into MAIN_20251020

c3e9505

Merge branch 'main' into MAIN_20251020

d8979cc

Merge branch 'main' into MAIN_20251020

e0dd999

Merge branch 'main' into MAIN_20251020

66aff82

Merge branch 'main' into MAIN_20251020

e527fd4

Merge branch 'main' into MAIN_20251020

bbfd2e9

Merge branch 'main' into MAIN_20251020

a25cccb

gshtras merged commit 49c00fe into vllm-project:main Oct 22, 2025
17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Mirroring changes in test-pipeline.yaml into test-amd.yaml #27242

Mirroring changes in test-pipeline.yaml into test-amd.yaml #27242

Uh oh!

Alexei-V-Ivanov-AMD commented Oct 21, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Oct 21, 2025

Uh oh!

gemini-code-assist bot Oct 21, 2025

Uh oh!

gemini-code-assist bot Oct 21, 2025

Uh oh!

gemini-code-assist bot Oct 21, 2025

Uh oh!

zhewenl Oct 21, 2025

Uh oh!

gshtras Oct 21, 2025

Uh oh!

zhewenl Oct 21, 2025

Uh oh!

Alexei-V-Ivanov-AMD Oct 21, 2025

Uh oh!

gshtras left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Mirroring changes in test-pipeline.yaml into test-amd.yaml #27242

Mirroring changes in test-pipeline.yaml into test-amd.yaml #27242

Uh oh!

Conversation

Alexei-V-Ivanov-AMD commented Oct 21, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

zhewenl Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

gshtras Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

zhewenl Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

Alexei-V-Ivanov-AMD Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

gshtras left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Alexei-V-Ivanov-AMD commented Oct 21, 2025 •

edited by github-actions bot

Loading