-
Notifications
You must be signed in to change notification settings - Fork 286
Pull requests: vllm-project/llm-compressor
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[tests] switch lm_eval invocation to use pre-loaded transformers model
ready
When a PR is ready for review
#2018
opened Nov 10, 2025 by
brian-dellabetta
Loading…
[Sequential Onloading] Support onloading and offloading frozen dataclasses
#2016
opened Nov 10, 2025 by
kylesayrs
Loading…
[Bugfix] IntermediatesCache nested model inputs
ready
When a PR is ready for review
#2015
opened Nov 10, 2025 by
kylesayrs
Loading…
Implement When a PR is ready for review
propagate_error argument
ready
#2008
opened Nov 10, 2025 by
kylesayrs
Loading…
Add Intel AutoRound algorithm support
ready
When a PR is ready for review
#1994
opened Nov 5, 2025 by
yiliu30
Loading…
3 tasks
[When a PR is ready for review
model_free_ptq] NVFP4A16
ready
#1988
opened Nov 3, 2025 by
kylesayrs
Loading…
[AWQ] Allow users to disable quantization during AWQ
#1973
opened Oct 28, 2025 by
brian-dellabetta
•
Draft
[Oneshot] Add validation for empty dataset and enhance oneshot function parameters
#1957
opened Oct 21, 2025 by
ArkaSanka
Loading…
[Attention] Support FP4 attention quantization
nvfp4
For any PR / issue related to NVFP4 support
#1924
opened Oct 14, 2025 by
kylesayrs
Loading…
[Training] Fix When a PR is ready for review
tokenizer attribute of SessionMixin
ready
#1895
opened Oct 1, 2025 by
kylesayrs
Loading…
[Logging] clean up CompressionLogger verbosity
ready
When a PR is ready for review
#1861
opened Sep 23, 2025 by
brian-dellabetta
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.