static quantization files for llama4-maverick #1968

sureshnam · 2025-09-22T22:11:58Z

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results

Purpose

Static quantization files for models to share to customer

Test Plan

Test Result

varistar · 2025-09-23T06:59:36Z

@sureshnam , are these quant scales identical to https://huggingface.co/sureshnam9/Llama-4-Maverick-17B-128E-Instruct-FP8/tree/main/quant/g3?

PatrykWo

Please move the 1.22.0 lower in structure, as subfolder of .static_quant.
EDIT: It's UI issue. Ignore my comment.

PatrykWo · 2025-09-23T14:08:37Z

@sureshnam please resolve pre-commit error.

sureshnam · 2025-09-23T18:47:36Z

@varistar, the files are the same uploaded to HF as well.

PatrykWo

LGTM

PatrykWo · 2025-09-24T07:04:45Z

/skip-gaudi-tests

varistar · 2025-09-29T11:57:29Z

We have another version of scales which give +20% perf speed-up enabling expert-parallelism, please hold the merge till we complete comparison and run accuracy testing

static quantization files for llama4-maverick

b429022

sureshnam requested review from PatrykWo, afierka-intel, deepvars, jikunshang, kzawora-intel, madamczyk-intel, mgawarkiewicz-intel, michalkuligowski, mswiniarsk, vivekgoe, wpyszka and xuechendi as code owners September 22, 2025 22:11

sureshnam added 2 commits September 23, 2025 09:09

update readme

d99c7af

update readme2

8d951ad

PatrykWo requested changes Sep 23, 2025

View reviewed changes

updated the README

2ce3bce

updated the README

b8a6a4f

PatrykWo approved these changes Sep 24, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

static quantization files for llama4-maverick #1968

static quantization files for llama4-maverick #1968

Uh oh!

sureshnam commented Sep 22, 2025 •

edited by github-actions bot

Loading

Uh oh!

varistar commented Sep 23, 2025

Uh oh!

PatrykWo left a comment •

edited

Loading

Uh oh!

PatrykWo commented Sep 23, 2025 •

edited

Loading

Uh oh!

sureshnam commented Sep 23, 2025

Uh oh!

PatrykWo left a comment

Uh oh!

PatrykWo commented Sep 24, 2025

Uh oh!

varistar commented Sep 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

static quantization files for llama4-maverick #1968

Are you sure you want to change the base?

static quantization files for llama4-maverick #1968

Uh oh!

Conversation

sureshnam commented Sep 22, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Essential Elements of an Effective PR Description Checklist

Purpose

Test Plan

Test Result

Uh oh!

varistar commented Sep 23, 2025

Uh oh!

PatrykWo left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

PatrykWo commented Sep 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sureshnam commented Sep 23, 2025

Uh oh!

PatrykWo left a comment

Choose a reason for hiding this comment

Uh oh!

PatrykWo commented Sep 24, 2025

Uh oh!

varistar commented Sep 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sureshnam commented Sep 22, 2025 •

edited by github-actions bot

Loading

PatrykWo left a comment •

edited

Loading

PatrykWo commented Sep 23, 2025 •

edited

Loading