Skip to content

Pull requests: allenai/open-instruct

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

rlzero template and fixes to rlzero scripts
#1216 opened Nov 20, 2025 by mnoukhov Loading…
Adds retries to checkpointing.
#1213 opened Nov 19, 2025 by finbarrtimbers Loading…
Simplify pending query map increments codex
#1207 opened Nov 18, 2025 by finbarrtimbers Loading…
added staleness metrics
#1204 opened Nov 18, 2025 by mnoukhov Loading…
Pad out 32b
#1177 opened Nov 12, 2025 by hamishivi Draft
Changes to make DPO run faster
#1175 opened Nov 11, 2025 by finbarrtimbers Draft
Added NCCL flags from DPO
#1165 opened Nov 10, 2025 by finbarrtimbers Loading…
RL Zero Scripts
#1162 opened Nov 10, 2025 by mnoukhov Draft
Correct loss accumulation for grpo_fast
#1161 opened Nov 10, 2025 by hamishivi Loading…
set to augusta when specified
#1138 opened Nov 3, 2025 by saumyamalik Loading…
Rename grpo_fast.py to grpo.py.
#1133 opened Nov 3, 2025 by finbarrtimbers Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.