-
Notifications
You must be signed in to change notification settings - Fork 482
Pull requests: pytorch/torchtitan
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
allow expert_parallel wrapper to handel kwargs
CLA Signed
This label is managed by the Meta Open Source bot.
#1620
opened Aug 22, 2025 by
rakkit
Loading…
Centralize Async TP Enablement with maybe_enable_async_tp API
CLA Signed
This label is managed by the Meta Open Source bot.
#1619
opened Aug 21, 2025 by
fegin
Loading…
Move the call to init_attention_mask to trainer
CLA Signed
This label is managed by the Meta Open Source bot.
#1616
opened Aug 21, 2025 by
fegin
Loading…
VLM: Onboarding native resolution, native aspect ratio, interleaved VLM training
CLA Signed
This label is managed by the Meta Open Source bot.
#1615
opened Aug 21, 2025 by
lkhphuc
Loading…
Switch DeepSeekV3 to Use FlexAttention by Default
CLA Signed
This label is managed by the Meta Open Source bot.
#1610
opened Aug 21, 2025 by
fegin
Loading…
[Validation] fix setting all model_parts to eval mode
CLA Signed
This label is managed by the Meta Open Source bot.
release blocking
Issues that are blocking the milestone / release completion
#1607
opened Aug 20, 2025 by
wesleytruong
Loading…
Bump version to v0.1.1
CLA Signed
This label is managed by the Meta Open Source bot.
#1606
opened Aug 20, 2025 by
wwwjn
Loading…
workarounds for all2all autograd issues that Ruisi ran into
CLA Signed
This label is managed by the Meta Open Source bot.
#1604
opened Aug 20, 2025 by
bdhirsh
Loading…
Adding StateDictAdapter
CLA Signed
This label is managed by the Meta Open Source bot.
#1601
opened Aug 19, 2025 by
HosseinKaviani-H
Loading…
Wrap sync + a2a in a custom op
CLA Signed
This label is managed by the Meta Open Source bot.
release blocking
Issues that are blocking the milestone / release completion
#1597
opened Aug 19, 2025 by
soulitzer
Loading…
Update torchft.md
CLA Signed
This label is managed by the Meta Open Source bot.
#1596
opened Aug 19, 2025 by
H-Huang
Loading…
improve MoE bias update logic in optimizer
CLA Signed
This label is managed by the Meta Open Source bot.
release blocking
Issues that are blocking the milestone / release completion
#1593
opened Aug 19, 2025 by
rakkit
Loading…
[WIP] Activation Offloading with Separate Stream
CLA Signed
This label is managed by the Meta Open Source bot.
#1591
opened Aug 18, 2025 by
excelle08
Loading…
Update SAC config to force save instead of recompute
CLA Signed
This label is managed by the Meta Open Source bot.
[WIP][DSV3] Remove keep a copy of GroupedExperts weight, free memory in StateDictAdapter
CLA Signed
This label is managed by the Meta Open Source bot.
#1585
opened Aug 16, 2025 by
wwwjn
Loading…
Muon with 3D tensors
CLA Signed
This label is managed by the Meta Open Source bot.
#1584
opened Aug 16, 2025 by
byronxu99
Loading…
Add config to AC to toggle early-stop
CLA Signed
This label is managed by the Meta Open Source bot.
#1580
opened Aug 15, 2025 by
soulitzer
Loading…
add model_parts ref to MetricsProcessor
CLA Signed
This label is managed by the Meta Open Source bot.
#1578
opened Aug 15, 2025 by
garrett361
Loading…
[EP] add initial support for NVSHMEM-based all-to-all
CLA Signed
This label is managed by the Meta Open Source bot.
#1569
opened Aug 14, 2025 by
tianyu-l
Loading…
[Do Not Land] Debug for SDPA + CP nan issue in DeepSeekV3
CLA Signed
This label is managed by the Meta Open Source bot.
Multinode SkyPilot example
CLA Signed
This label is managed by the Meta Open Source bot.
#1564
opened Aug 13, 2025 by
alex000kim
Loading…
fix: remove redundant legacy usage of mp in checkpoint
CLA Signed
This label is managed by the Meta Open Source bot.
#1562
opened Aug 13, 2025 by
yzs981130
Loading…
[WIP] Experimental implementation of gpt-oss (grouped GEMM MoE + FlexAttention sink/sliding)
#1559
opened Aug 13, 2025 by
KhoomeiK
Loading…
[PoC] Enable flexible different layout for same mesh via a util function
CLA Signed
This label is managed by the Meta Open Source bot.
#1550
opened Aug 11, 2025 by
fduwjj
Loading…
[WIP] [mxfp8] torchao mxfp8 moe integration
CLA Signed
This label is managed by the Meta Open Source bot.
#1549
opened Aug 11, 2025 by
danielvegamyhre
•
Draft
Previous Next
ProTip!
Follow long discussions with comments:>50.