-
Notifications
You must be signed in to change notification settings - Fork 33
Pull requests: vllm-project/vllm-gaudi
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Feature][SpecDecode][Part2] Eagle3,MTP enabling, accept_rate improvement
#142
opened Sep 6, 2025 by
xuechendi
Loading…
Do not add max_blocks as a bucket in linear bucketing with cpa
#121
opened Sep 2, 2025 by
mswiniarsk
Loading…
Enable LMCache for cpuoffloading, LMCache docker support, enable lmcache
#64
opened Aug 6, 2025 by
hsubramony
•
Draft
Add graph compilation tracking to high level profiler
#50
opened Jul 28, 2025 by
kzawora-intel
Loading…
ProTip!
Updated in the last three days: updated:>2025-09-06.