Support multimodal in logit checker + match gemma3 logits with HF #2203

aireenmei · 2025-08-19T05:44:30Z

Description

I found some mismatch in gemma3 logits compared with HF (b/437988753).
Key changes to fill the gap:

Match the vocab size: HF uses a bigger vocab size which includes special image tokens.
Special image token ID matches with HF
Use linear Rope scaling with factor 8 for global attention layers in language model
This will be a separate PR due to potential conflicts with the ongoing NNX migration: Allow setting gemma3 vision encoder precision through config.matmul_precision

Tests

add unittest in check_gemma3_layers.py to check RoPE implementation
logits test:
Use generate_hf_golden_logits.py to generate HF logits on a m1 CPU, dtype=float32
Run forward_pass_logit_checker on a v5p TPU, with all the dype and precision set to float32
Results: see b/437988753

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run end-to-end tests tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed.

hengtaoguo · 2025-08-20T19:47:07Z

MaxText/layers/gemma3.py

Thank you Aireen for digging deep and finding this precision issue!

Is there any chance that you could split out the "precision" changes in gemma3.py into a separate PR? Asking because we are doing the NNX migration, and all of these layers will be rewritten from nn to nnx (example). I guess we can plugin these "precision" changes after the migration? Either way works for me.

gagika · 2025-08-20T23:42:14Z

MaxText/configs/models/gemma3-12b.yml

@@ -21,7 +21,7 @@ base_num_kv_heads: 8
 base_mlp_dim: 15360
 head_dim: 256
 mlp_activations: ["gelu","linear"]
-vocab_size: 262_144
+vocab_size: 262_208


why do we need this change here an in other model configs?

doesn't this change impact checkpoint checkpoint conversion?

There embedding lookup and unembed layers depend on the vocab size.

aireenmei changed the title ~~Support multimodal in logit checker + match gemma3 logit to HF~~ Support multimodal in logit checker + match gemma3 logits wit HF Aug 20, 2025

aireenmei changed the title ~~Support multimodal in logit checker + match gemma3 logits wit HF~~ Support multimodal in logit checker + match gemma3 logits with HF Aug 20, 2025

aireenmei force-pushed the aireen/logits branch from f27a779 to f8b4331 Compare August 20, 2025 06:09

aireenmei marked this pull request as ready for review August 20, 2025 06:30

aireenmei requested review from gobbleturk, khatwanimohit, bvandermoon, vipannalla, RissyRan, richjames0, gagika, shralex, yangyuwei, SurbhiJainUSC, hengtaoguo, A9isha and NuojCheng as code owners August 20, 2025 06:30

aireenmei assigned gagika and hengtaoguo Aug 20, 2025

hengtaoguo reviewed Aug 20, 2025

View reviewed changes

aireenmei force-pushed the aireen/logits branch 2 times, most recently from 0a94059 to 5ea4e1a Compare August 20, 2025 20:57

Support multimodal in logit checker and match gemma3 logits with HF

225117d

aireenmei force-pushed the aireen/logits branch from 5ea4e1a to 225117d Compare August 20, 2025 22:47

gagika reviewed Aug 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support multimodal in logit checker + match gemma3 logits with HF #2203

Support multimodal in logit checker + match gemma3 logits with HF #2203

Uh oh!

aireenmei commented Aug 19, 2025 •

edited

Loading

Uh oh!

hengtaoguo Aug 20, 2025

Uh oh!

gagika Aug 20, 2025

Uh oh!

Uh oh!

Support multimodal in logit checker + match gemma3 logits with HF #2203

Are you sure you want to change the base?

Support multimodal in logit checker + match gemma3 logits with HF #2203

Uh oh!

Conversation

aireenmei commented Aug 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Tests

Checklist

Uh oh!

hengtaoguo Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

gagika Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

aireenmei commented Aug 19, 2025 •

edited

Loading