add new test model for aclgraph single_request #3888

lilinsiman · 2025-10-30T02:18:18Z

What this PR does / why we need it?

add new test model for aclgraph single_request

Does this PR introduce any user-facing change?

no

How was this patch tested?

ut

vLLM version: v0.11.0
vLLM main: vllm-project/vllm@83f478b

github-actions · 2025-10-30T02:18:26Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist

Code Review

This pull request adds a new test model, vllm-ascend/DeepSeek-V2-Lite-W8A8, to the single_request_aclgraph end-to-end test. My review identifies a critical issue with how server arguments are constructed for this new model, which would cause the test to fail. I've provided a code suggestion to fix the bug and refactor the code for better readability and maintainability.

gemini-code-assist · 2025-10-30T02:19:48Z

tests/e2e/multicard/test_single_request_aclgraph.py

+    if model == "vllm-ascend/DeepSeek-V2-Lite-W8A8":
+        server_args = [
        "--no-enable-prefix-caching", "--tensor-parallel-size", "1",
-        "--data-parallel-size",
+        "--data-parallel-size", "quantization", "ascend",
        str(dp_size), "--port",
        str(port), "--trust-remote-code", "--gpu-memory-utilization", "0.9"
    ]
+    else:
+        server_args = [
+            "--no-enable-prefix-caching", "--tensor-parallel-size", "1",
+            "--data-parallel-size",
+            str(dp_size), "--port",
+            str(port), "--trust-remote-code", "--gpu-memory-utilization", "0.9"
+        ]


The server arguments for the new model vllm-ascend/DeepSeek-V2-Lite-W8A8 are incorrect. The arguments "quantization" and "ascend" are misplaced and will cause the argument parser to fail. They should be passed as "--quantization", "ascend".

Additionally, there is significant code duplication between the if and else blocks. This can be refactored to improve readability and maintainability by defining the common arguments first and then conditionally adding the model-specific ones.

Suggested change

if model == "vllm-ascend/DeepSeek-V2-Lite-W8A8":

server_args = [

"--no-enable-prefix-caching", "--tensor-parallel-size", "1",

"--data-parallel-size",

"--data-parallel-size", "quantization", "ascend",

str(dp_size), "--port",

str(port), "--trust-remote-code", "--gpu-memory-utilization", "0.9"

]

else:

server_args = [

"--no-enable-prefix-caching", "--tensor-parallel-size", "1",

"--data-parallel-size",

str(dp_size), "--port",

str(port), "--trust-remote-code", "--gpu-memory-utilization", "0.9"

]

server_args = [

"--no-enable-prefix-caching", "--tensor-parallel-size", "1",

"--data-parallel-size", str(dp_size),

"--port", str(port),

"--trust-remote-code", "--gpu-memory-utilization", "0.9"

]

if model == "vllm-ascend/DeepSeek-V2-Lite-W8A8":

server_args.extend(["--quantization", "ascend"])

Signed-off-by: lilinsiman <[email protected]>

github-actions bot added the module:tests label Oct 30, 2025

gemini-code-assist bot reviewed Oct 30, 2025

View reviewed changes

lilinsiman force-pushed the aclgraph_single branch from bfaf977 to 58c6ea0 Compare October 30, 2025 02:51

MengqingCao added ready read for review ready-for-test start test by label for PR labels Oct 30, 2025

lilinsiman force-pushed the aclgraph_single branch 4 times, most recently from 9e814d7 to 27aef0b Compare October 30, 2025 11:48

add new test model for aclgraph single_request

e1d34cb

Signed-off-by: lilinsiman <[email protected]>

lilinsiman force-pushed the aclgraph_single branch from 27aef0b to e1d34cb Compare October 30, 2025 12:04

yiz-liu approved these changes Oct 31, 2025

View reviewed changes

yiz-liu merged commit 1f486b2 into vllm-project:main Oct 31, 2025
22 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add new test model for aclgraph single_request #3888

add new test model for aclgraph single_request #3888

lilinsiman commented Oct 30, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Oct 30, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Oct 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

add new test model for aclgraph single_request #3888

add new test model for aclgraph single_request #3888

Conversation

lilinsiman commented Oct 30, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Oct 30, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Oct 30, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lilinsiman commented Oct 30, 2025 •

edited by github-actions bot

Loading