Nvfpp #5

yiliu30 · 2025-09-05T01:53:01Z

SUMMARY:
"please provide a brief summary"

TEST PLAN:
"please outline how the changes were tested"

Signed-off-by: yiliu30 <[email protected]>

SUMMARY: With the newest transformer change, `test_kv_cache_gptq_model_state_dict_attr` is failing because it's initializing empty weights on meta device and attempting to decompress on meta device. I don't think this is the expected usage. When model_decompress is called, the weights should finish being loaded already. TEST PLAN: tested locally with the following command and passed: pytest test tests/llmcompressor/transformers/kv_cache/test_kv_cache.py::test_kv_cache_gptq_model_state_dict_attr --------- Signed-off-by: shanjiaz <[email protected]>

Signed-off-by: yiliu30 <[email protected]>

…into nvfpp

yiliu30 and others added 5 commits September 4, 2025 08:24

add nvfpp-1

d159a3c

Signed-off-by: yiliu30 <[email protected]>

Merge branch 'vllm-project:main' into nvfpp

90ad9e2

b16

8c1c161

Signed-off-by: yiliu30 <[email protected]>

Merge branch 'nvfpp' of https://github.com/yiliu30/llm-compressor-fork …

4455452

…into nvfpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Nvfpp #5

Nvfpp #5

Uh oh!

yiliu30 commented Sep 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Nvfpp #5

Are you sure you want to change the base?

Nvfpp #5

Uh oh!

Conversation

yiliu30 commented Sep 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants