-
Notifications
You must be signed in to change notification settings - Fork 399
Pull requests: AI-Hypercomputer/maxtext
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add an option to save a checkpoint on completion
#2222
opened Aug 21, 2025 by
xuefgu
Loading…
4 tasks done
Provide MaxText axes to cudnn_flash_te to correctly perform dbias reduction if required
#2221
opened Aug 21, 2025 by
jberchtold-nvidia
Loading…
4 tasks done
Update RESTRUCTURE.md to include latest changes
#2220
opened Aug 21, 2025 by
bvandermoon
Loading…
4 tasks done
Add stochastic decoding sampling strategy, allowing run-time sampling parameters change.
#2215
opened Aug 20, 2025 by
babyplutokurt
Loading…
4 tasks done
[Qwix] Correctly plumb quantization_rule to kernel
#2206
opened Aug 19, 2025 by
khatwanimohit
Loading…
4 tasks done
Fix [late-directive] error reported by pytype static analyzer in max_utils.py
#2205
opened Aug 19, 2025 by
copybara-service
bot
Loading…
Disable depth scaling in query projection if qk_norm or query_pre_attn_scalar are set
#2204
opened Aug 19, 2025 by
gagika
Loading…
4 tasks done
Support multimodal in logit checker + match gemma3 logits with HF
#2203
opened Aug 19, 2025 by
aireenmei
Loading…
4 tasks done
Migrate DotProductAttention to NNX
#2198
opened Aug 18, 2025 by
hsuan-lun-chiang
Loading…
4 tasks done
Refactor MaxText's
checkpointing.py
to support sharded loading of SafeTensor's checkpoints.
#2192
opened Aug 15, 2025 by
copybara-service
bot
Loading…
Add
remote_bytes_transferred
to Pallas CostEstimate
dataclass.
#2191
opened Aug 15, 2025 by
copybara-service
bot
Loading…
XLA flag
xla_gpu_graph_level
has been depreciated, and should use
#2186
opened Aug 15, 2025 by
copybara-service
bot
Loading…
XLA flag
xla_gpu_graph_level
has been depreciated, and should use
#2184
opened Aug 15, 2025 by
copybara-service
bot
Loading…
Use pl.ANY/pl.MemorySpace.ANY instead of the pltpu variant in preparation for its deprecation.
#2180
opened Aug 15, 2025 by
copybara-service
bot
Loading…
Enhance Checkpoint Converter for MoE Models
#2177
opened Aug 14, 2025 by
parambole
Loading…
4 tasks done
Add valid test cases list for sharding tests
#2175
opened Aug 14, 2025 by
hsuan-lun-chiang
Loading…
4 tasks done
Previous Next
ProTip!
Adding no:label will show everything without a label.