Minor tweaks to newer models #50

derekk-nm · 2025-09-04T19:26:11Z

SUMMARY:

updates to Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 server configuration to attempt to get the accuracy workflow to run.
RedHatAI/DeepSeek-R1-0528-quantized.w4a16 was missing its base model, and a server option.
HuggingFaceTB/SmolLM3-3B is updated with an actual ground-truth gsm8k value.

TEST PLAN:
models were passed through the accuracy workflow.
the Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 model is just too big to test w/out access to a larger system.

tmuttaki · 2025-09-04T19:32:23Z

I saw these models in @debroy-rh 's small list of models for code coverage but these are not present in model validation config. Are you going to add these as well?

deepseek-ai/DeepSeek-V2-Lite
microsoft/Phi-3-medium-4k-instruct
Qwen/Qwen2-57B-A14B-Instruct
RedHatAI/Meta-Llama-3-8B-Instruct-FP8-KV
RedHatAI/Mixtral-8x7B-Instruct-v0.1-FP8
TechxGenus/Meta-Llama-3-8B-Instruct-GPTQ

debroy-rh

look good, thanks Derek.

derekk-nm · 2025-09-05T13:25:38Z

Thanks for the observation @tmuttaki .
Those models appear in the neuralmagic/lm-eval-configs/models directory because they were part of testing prior to the NeuralMagic acquisition by Red Hat. They haven't been included as part of the Third Party Model Validation program initiated at the beginning of Red Hat Summit. (this is the source I'm using for models in that program).

derekk-nm added 4 commits September 2, 2025 14:48

add deepseek-ai path, fix Smol gsm8k value

a4c6650

Qwen3 model on quad, deepseek limit gpu memory

befaaf2

try Qwen3 gpu_memory_utilization

055ec5c

try max-model-len

a633fb1

derekk-nm requested review from dbarbuzzi, dhuangnm, andy-neuma and debroy-rh September 4, 2025 19:26

debroy-rh approved these changes Sep 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Minor tweaks to newer models #50

Minor tweaks to newer models #50

Uh oh!

derekk-nm commented Sep 4, 2025

Uh oh!

tmuttaki commented Sep 4, 2025 •

edited

Loading

Uh oh!

debroy-rh left a comment

Uh oh!

derekk-nm commented Sep 5, 2025

Uh oh!

Uh oh!

Minor tweaks to newer models #50

Are you sure you want to change the base?

Minor tweaks to newer models #50

Uh oh!

Conversation

derekk-nm commented Sep 4, 2025

Uh oh!

tmuttaki commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

debroy-rh left a comment

Choose a reason for hiding this comment

Uh oh!

derekk-nm commented Sep 5, 2025

Uh oh!

Uh oh!

tmuttaki commented Sep 4, 2025 •

edited

Loading