[Test] Test ge graph use DeepSeek-V2-Lite model #3842

ForBetterCodeNine · 2025-10-28T13:21:10Z

What this PR does / why we need it?

The existing tests lack coverage for the DeepSeek-V3-Lite network model under the GE Graph mode, so supplementary tests will be conducted.

Does this PR introduce any user-facing change?

No

How was this patch tested?

vLLM version: v0.11.0
vLLM main: vllm-project/vllm@83f478b

Signed-off-by: CodeNine-CJ <[email protected]>

github-actions · 2025-10-28T13:21:18Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist

Code Review

This pull request adds end-to-end tests for the DeepSeek-V3-Lite model in GE Graph mode. The implementation introduces a new test fixture that is largely a duplication of an existing one, which impacts maintainability. More critically, the new test appears to use incorrect golden data copied from another test, which undermines the validity of the test itself. I've provided two comments: one critical issue regarding the test data and a high-severity issue about code duplication.

gemini-code-assist · 2025-10-28T13:22:58Z

tests/e2e/multicard/test_torchair_graph_mode.py

+     # NOTE: vllm-ascend/DeepSeek-V3-Pruning is a random weight of
+    # DeepSeek-V3 with 2 hidden layers, thus the golden results seems
+    # inaccurate. This will only change if accuracy improves with the
+    # official weights of DeepSeek-V3.
+    golden_results = [
+        'Hello, my name is下载早点向前很有่อง',
+        'The president of the United States isSender)## physiological Albany',
+        'The capital of France is Rocky转角 hospitalizedinterval sparked',
+        'The future of AI is её asegο BIOS一扫',
+    ]   


The golden_results and the accompanying comment appear to be copied from the test for vllm-ascend/DeepSeek-V3-Pruning without being updated for the deepseek-ai/DeepSeek-V3-Lite model.

Incorrect Comment: The comment at lines 263-266 refers to vllm-ascend/DeepSeek-V3-Pruning, which is not the model under test here. This is misleading.

Suspicious Golden Results: The golden_results are identical to those for vllm-ascend/DeepSeek-V3-Pruning. It is extremely unlikely that two different models produce the exact same output. This strongly suggests the golden values are incorrect for deepseek-ai/DeepSeek-V3-Lite.

A test with incorrect golden data is critically flawed as it doesn't validate the model's behavior and can mask regressions. Please regenerate the golden_results using the actual output from deepseek-ai/DeepSeek-V3-Lite and update the comment to be relevant to this model, or remove it if the output is accurate.

gemini-code-assist · 2025-10-28T13:22:58Z

tests/e2e/multicard/test_torchair_graph_mode.py

+def _deepseek_v3_lite_torchair_test_fixure(
+    additional_config:Dict,
+    *,
+    tensor_parallel_size=2,
+    use_v1_schduler=False,
+):
+    example_prompts = [
+        "Hello, my name is",
+        "The president of the United States is",
+        "The capital of France is",
+        "The future of AI is",
+    ]
+
+    kwargs = {}
+    if not use_v1_schduler:
+        kwargs = {
+            "ascend_scheduler_config":{
+                "enable":True,
+            },
+            "refresh":True,
+        }
+    additional_config.update(**kwargs)
+
+    with VllmRunner(
+        "deepseek-ai/DeepSeek-V3-Lite",
+        dtype="half",
+        tensor_parallel_size=tensor_parallel_size,
+        distributed_executor_backend="mp",
+        additional_config=additional_config,
+    )as vllm_model:
+        vllm_output = vllm_model.generate_greedy(example_prompts, 5)
+
+     # NOTE: vllm-ascend/DeepSeek-V3-Pruning is a random weight of
+    # DeepSeek-V3 with 2 hidden layers, thus the golden results seems
+    # inaccurate. This will only change if accuracy improves with the
+    # official weights of DeepSeek-V3.
+    golden_results = [
+        'Hello, my name is下载早点向前很有่อง',
+        'The president of the United States isSender)## physiological Albany',
+        'The capital of France is Rocky转角 hospitalizedinterval sparked',
+        'The future of AI is её asegο BIOS一扫',
+    ]   
+
+    assert len(golden_results) == len(vllm_output)
+    for i in range(len(vllm_output)):
+        assert golden_results[i] == vllm_output[i][1]
+        print(f"Generated text:{vllm_output[i][1]!r}")
+


This new test fixture _deepseek_v3_lite_torchair_test_fixure is almost an exact copy of the existing _deepseek_torchair_test_fixture function (lines 32-79). The main difference is the model name passed to VllmRunner.

To improve maintainability and avoid code duplication, this should be refactored. You could parameterize the existing _deepseek_torchair_test_fixture to accept the model name and golden results as arguments. This would eliminate the need for the new _deepseek_v3_lite_torchair_test_fixure function entirely, making the code cleaner and easier to maintain.

For example:

def _deepseek_torchair_test_fixture( model_name: str, golden_results: list[str], additional_config: dict, # ... other params ): # ... existing logic ... with VllmRunner( model_name, # Use the new parameter # ... ) as vllm_model: vllm_output = vllm_model.generate_greedy(example_prompts, 5) # ... assertion logic using golden_results ... # Then update call sites: def test_e2e_deepseekv3_with_torchair(): # ... _deepseek_torchair_test_fixture( "vllm-ascend/DeepSeek-V3-Pruning", DEEPSEEK_V3_PRUNING_GOLDEN_RESULTS, additional_config ) def test_e2e_deepseekv3lite_with_torchair(): # ... _deepseek_torchair_test_fixture( "deepseek-ai/DeepSeek-V3-Lite", DEEPSEEK_V3_LITE_GOLDEN_RESULTS, additional_config )

Addressing the code duplication will also make it easier to manage model-specific details like the golden_results and explanatory comments.

Signed-off-by: CodeNine-CJ <[email protected]>

ForBetterCodeNine added 27 commits October 27, 2025 11:30

test ge graph

64fa6c8

Signed-off-by: CodeNine-CJ <[email protected]>

ge graph

1549c01

test ge

0894319

test ge

fd16a1c

Signed-off-by: CodeNine-CJ <[email protected]>

ge graph test

7966f03

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

a7aadb0

Signed-off-by: CodeNine-CJ <[email protected]>

ge test

f988bf9

Signed-off-by: CodeNine-CJ <[email protected]>

ge test

47772a9

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

b219dac

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

1aa83d3

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

ad78f95

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

ac1097f

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

ca6b6e3

Signed-off-by: CodeNine-CJ <[email protected]>

ge test

159a1b9

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

5b32377

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

5c50961

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

2e80d5e

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

5ccbf67

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

292bedb

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

9688242

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

a4df9f8

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

a66705e

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

e270d07

Signed-off-by: CodeNine-CJ <[email protected]>

test ge:

e826fea

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

a0fe99b

Signed-off-by: CodeNine-CJ <[email protected]>

test_ge_graph_deepseek_v3_lite

ea39029

Signed-off-by: CodeNine-CJ <[email protected]>

test_ge_graph

c90f1ef

Signed-off-by: CodeNine-CJ <[email protected]>

github-actions bot added the module:tests label Oct 28, 2025

gemini-code-assist bot reviewed Oct 28, 2025

View reviewed changes

ForBetterCodeNine added 18 commits October 28, 2025 21:34

test ge

391b508

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

c4c6064

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

57ebead

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

f6e835f

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

e896728

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

81052e0

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

1118feb

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

501e61e

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

6380898

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

4bed853

Signed-off-by: CodeNine-CJ <[email protected]>

test ge ut

71898fd

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

9daab62

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

6bb7e86

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

d722e47

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

407b3dc

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

fe35fa2

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

6f149d2

Signed-off-by: CodeNine-CJ <[email protected]>

test ge

83e9960

Signed-off-by: CodeNine-CJ <[email protected]>

ForBetterCodeNine changed the title ~~[Test] Test ge graph use DeepSeek-V3-Lite model~~ [Test] Test ge graph use DeepSeek-V2-Lite model Oct 30, 2025

wangxiyuan approved these changes Oct 31, 2025

View reviewed changes

wangxiyuan added the ready read for review label Oct 31, 2025

test ge

6de435f

Signed-off-by: CodeNine-CJ <[email protected]>

ForBetterCodeNine closed this Oct 31, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Test] Test ge graph use DeepSeek-V2-Lite model #3842

[Test] Test ge graph use DeepSeek-V2-Lite model #3842

ForBetterCodeNine commented Oct 28, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Oct 28, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Oct 28, 2025

Uh oh!

gemini-code-assist bot Oct 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[Test] Test ge graph use DeepSeek-V2-Lite model #3842

[Test] Test ge graph use DeepSeek-V2-Lite model #3842

Conversation

ForBetterCodeNine commented Oct 28, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Oct 28, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ForBetterCodeNine commented Oct 28, 2025 •

edited by github-actions bot

Loading