[TEST]Add 2P1D multi node cases for nightly test #3764

jiangyunfan1 · 2025-10-25T09:06:45Z

What this PR does / why we need it?

This PR adds the 2P1D multi node func/acc/perf test cases, we need test them daily

Does this PR introduce any user-facing change?

No

How was this patch tested?

by running the test

vLLM version: v0.11.0rc3
vLLM main: vllm-project/vllm@c9461e0

Signed-off-by: jiangyunfan1 <[email protected]>

github-actions · 2025-10-25T09:07:04Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist

Code Review

This pull request enables and updates a multi-node end-to-end test for the DeepSeek-R1-W8A8 model. The changes include updating benchmark configurations in the YAML file, implementing the test logic in test_multi_node.py to run performance and accuracy tests using aisbench, and a small fix in aisbench.py to handle single test cases. My review focuses on the implementation in test_multi_node.py. I've identified a potential issue with calling blocking code from an asynchronous function, which could impact performance and is against asyncio best practices. I've provided a suggestion to fix this.

gemini-code-assist · 2025-10-25T09:07:45Z

tests/e2e/nightly/multi_node/test_multi_node.py

+                run_aisbench_cases(config.model, port, acc_cmd)
+                run_aisbench_cases(config.model, port, perf_cmd)


The run_aisbench_cases function is synchronous and performs blocking I/O operations (e.g., subprocess.Popen and reading from stdout). Calling it directly from an async function like test_multi_node will block the asyncio event loop, which can lead to performance issues and defeats the purpose of using asyncio.

You should run blocking functions in a separate thread to avoid blocking the event loop. You can use asyncio.to_thread for this.

Note: You'll need to add import asyncio at the top of the file.

Suggested change

run_aisbench_cases(config.model, port, acc_cmd)

run_aisbench_cases(config.model, port, perf_cmd)

await asyncio.to_thread(run_aisbench_cases, config.model, port, acc_cmd)

await asyncio.to_thread(run_aisbench_cases, config.model, port, perf_cmd)

Signed-off-by: jiangyunfan1 <[email protected]>

Signed-off-by: wangli <[email protected]>

jiangyunfan1 added 4 commits October 24, 2025 07:41

add prefix cache qwen3-32b-int8

28a9e32

Signed-off-by: jiangyunfan1 <[email protected]>

Merge remote-tracking branch 'upstream/main' into new_branch4

a638131

Merge remote-tracking branch 'upstream/main' into new_branch4

153c621

add 2P1D tests

c61de8d

Signed-off-by: jiangyunfan1 <[email protected]>

github-actions bot added module:tests module:tools labels Oct 25, 2025

gemini-code-assist bot reviewed Oct 25, 2025

View reviewed changes

jiangyunfan1 changed the title ~~New branch4~~ [TEST]Add 2P1D multi node cases for nightly test Oct 25, 2025

jiangyunfan1 and others added 10 commits October 25, 2025 09:25

consider empty tests

09081cd

Signed-off-by: jiangyunfan1 <[email protected]>

rm some aisbench conf

1fa55f6

Signed-off-by: jiangyunfan1 <[email protected]>

add install ais bench

340d0ab

Signed-off-by: wangli <[email protected]>

fix assert error

713e661

Signed-off-by: wangli <[email protected]>

fix

623f55d

Signed-off-by: wangli <[email protected]>

fix

6be2099

Signed-off-by: wangli <[email protected]>

fix host

0cbd957

Signed-off-by: wangli <[email protected]>

fix

27868ba

Signed-off-by: wangli <[email protected]>

fix

18c61bc

Signed-off-by: wangli <[email protected]>

fix lint

b107b9e

Signed-off-by: wangli <[email protected]>

Potabk mentioned this pull request Oct 27, 2025

[CI] Add multi-node test case for a2 #3805

Merged

wangxiyuan approved these changes Oct 27, 2025

View reviewed changes

wangxiyuan merged commit 9030106 into vllm-project:main Oct 27, 2025
20 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[TEST]Add 2P1D multi node cases for nightly test #3764

[TEST]Add 2P1D multi node cases for nightly test #3764

jiangyunfan1 commented Oct 25, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Oct 25, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Oct 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		run_aisbench_cases(config.model, port, acc_cmd)
		run_aisbench_cases(config.model, port, perf_cmd)

Uh oh!

[TEST]Add 2P1D multi node cases for nightly test #3764

[TEST]Add 2P1D multi node cases for nightly test #3764

Conversation

jiangyunfan1 commented Oct 25, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Oct 25, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Oct 25, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jiangyunfan1 commented Oct 25, 2025 •

edited by github-actions bot

Loading