-
Notifications
You must be signed in to change notification settings - Fork 449
Description
Release Checklist
Release Version: v0.10.2rc1
Release Branch: main
Release Date: 2025/09/10
Release Manager: @wangxiyuan
Prepare Release Note
-
Create a new issue for release feedback [v0.10.2rc1] FAQ / Feedback | 问题/反馈 #2874
-
Write the release note PR. [Release] Add release note for v0.10.2rc1 #2921
-
Update the feedback issue link in docs/source/faqs.md
-
Add release note to docs/source/user_guide/release_notes.md
-
Update release version in README.md and README.zh.md
-
Update version info in docs/source/community/versioning_policy.md
-
Update contributor info in docs/source/community/contributors.md
-
Update package version in docs/conf.py
-
PR need Merge
-
[Bug]: accuract test failed due to unexpected keyword argument 'prompt_token_ids' #2865
-
Quantization accuracy problem [Quantization] register AscendQuantRMSNorm for quantization #2856 @22dimensions
-
aclgraph quantization support @yiz-liu [Feat]support dynamic quantization in allgather #2841
-
Fix the bugs about operator registration by PyTorch Dispatcher #2786
-
[Bug]: accuracy test failed due to
Forward context is not set
#2876 accuracy test fix @wxsIcey -
[Core] Disable the chunked prefill feature in Non-MLA LLMs #2894
-
[Bug]: Qwen 235B Multi-Node + 1P1D + 2DP + 16TP + EP failed #2900
-
GLM accuracy problem @shen-shanshan
-
Qwen3-next support @MengqingCao @wangxiyuan
Will not be merged:
Functional Test
- Qwen3 235B aclgraph test @zhangxinyuehfad
- Qwen 235B PD DP EP TP @zhangxinyuehfad
- deepseek v3 + torchair + all2allv @Potabk
- Async scheduler feature test @Potabk
- Qwen2.5 VL W8A8 @22dimensions
- Qwen3 8B + flash comm @Potabk
- aclgrah w8a8 @Potabk
Doc Test
- Tutorial is updated.
- User Guide is updated.
- Developer Guide is updated.
Prepare Artifacts
- Docker image is ready.
- Wheel package is ready.
Release Step
- Release note PR is merged.
- Post the release on GitHub release page.
- Generate official doc page on https://app.readthedocs.org/dashboard/
- Wait for the wheel package to be available on https://pypi.org/project/vllm-ascend
- Wait for the docker image to be available on https://quay.io/ascend/vllm-ascend
- Upload 310p wheel to Github release page
- Broadcast the release news (By message, blog , etc)
- Close this issue