vllm-project / llm-compressor Public

Notifications You must be signed in to change notification settings
Fork 286
Star 2.2k

Code
Issues 64
Pull requests 43
Discussions
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Pull requests: vllm-project/llm-compressor

Labels 21 Milestones 0

New pull request New

43 Open 910 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Support wInt4aFp8 for moe

#2027 opened Nov 12, 2025 by Wangzheee

Loading…

[tests] switch lm_eval invocation to use pre-loaded transformers model ready

When a PR is ready for review

#2018 opened Nov 10, 2025 by brian-dellabetta

Loading…

[Sequential Onloading] Support onloading and offloading frozen dataclasses

#2016 opened Nov 10, 2025 by kylesayrs

Loading…

[Bugfix] IntermediatesCache nested model inputs ready

When a PR is ready for review

#2015 opened Nov 10, 2025 by kylesayrs

Loading…

[TypeHint] Fix format_calibration_data type hint

#2012 opened Nov 10, 2025 by kylesayrs

Loading…

Fix: Set TOKENIZERS_PARALLELISM to false

#2009 opened Nov 10, 2025 by Nsumithreddy • Draft

Implement propagate_error argument ready

When a PR is ready for review

#2008 opened Nov 10, 2025 by kylesayrs

Loading…

Granite4 FP8 Block Quantization

#2001 opened Nov 6, 2025 by krishnateja95

Loading…

Add Intel AutoRound algorithm support ready

When a PR is ready for review

#1994 opened Nov 5, 2025 by yiliu30

Loading…

3 tasks

[model_free_ptq] NVFP4A16 ready

When a PR is ready for review

#1988 opened Nov 3, 2025 by kylesayrs

Loading…

[Kimi Linear] FP8 Example

#1986 opened Oct 31, 2025 by dsikka • Draft

[AWQ] Allow users to disable quantization during AWQ

#1973 opened Oct 28, 2025 by brian-dellabetta • Draft

[WIP] Generalize AWQ quantization

#1961 opened Oct 22, 2025 by kylesayrs • Draft

Adding new MoE e2e tests fp8

For any issue / PR related to FP8 support

nvfp4

For any PR / issue related to NVFP4 support

ready

When a PR is ready for review

wNa16

Anything related to weight-only int-quantized support

#1960 opened Oct 22, 2025 by HDCharles

Loading…

[Oneshot] Add validation for empty dataset and enhance oneshot function parameters

#1957 opened Oct 21, 2025 by ArkaSanka

Loading…

[Autowrapper] Trace vision tower for better offloading

#1948 opened Oct 18, 2025 by kylesayrs • Draft

[MXFP4] Support

#1938 opened Oct 15, 2025 by dsikka • Draft

[Observers] Change MSE global scale objective function

#1935 opened Oct 14, 2025 by kylesayrs • Draft

AI Fix for: Create AWQ guide for llm-docs

#1932 opened Oct 14, 2025 by shanaya-Gupta

Loading…

[Attention] Support FP4 attention quantization nvfp4

For any PR / issue related to NVFP4 support

#1924 opened Oct 14, 2025 by kylesayrs

Loading…

Add: File Based Caching for lm_eval tests

#1900 opened Oct 6, 2025 by rahul-tuli • Draft

[Training] Fix tokenizer attribute of SessionMixin ready

When a PR is ready for review

#1895 opened Oct 1, 2025 by kylesayrs

Loading…

add gpt oss nvfp4 example

#1885 opened Sep 30, 2025 by shanjiaz • Draft

Add awq activation fp8 support in loss compute

#1873 opened Sep 27, 2025 by Bluedyson

Loading…

[Logging] clean up CompressionLogger verbosity ready

When a PR is ready for review

#1861 opened Sep 23, 2025 by brian-dellabetta

Loading…

Previous 1 2 Next

Previous Next

ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!