Skip to content

Pull requests: eval-protocol/python-sdk

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

add logger to typescript package
#332 opened Nov 15, 2025 by xzrderek Loading…
vision food reasoning eval
#331 opened Nov 14, 2025 by benjibc Loading…
Update Klavis MCP use case
#330 opened Nov 14, 2025 by LLiuZheng Loading…
Text to SQL RFT example
#324 opened Nov 10, 2025 by benjibc Loading…
added model quality gha
#319 opened Nov 6, 2025 by shreymodi1 Loading…
18 tasks
swe-bench
#280 opened Oct 15, 2025 by shreymodi1 Loading…
reasoning effort string change
#267 opened Oct 10, 2025 by shreymodi1 Loading…
18 tasks
reuse pydantic example for local model picking
#251 opened Oct 5, 2025 by benjibc Loading…
pyyaml removal step 1
#247 opened Oct 3, 2025 by benjibc Loading…
directly hit enter to select
#245 opened Oct 2, 2025 by benjibc Loading…
auto convert from dict
#239 opened Sep 30, 2025 by mayinghan Loading…
18 tasks
Route benchmark datasets through data loaders codex
#229 opened Sep 27, 2025 by benjibc Loading…
Eval agent
#200 opened Sep 20, 2025 by dphuang2 Loading…
Fix type errors and enable pre commit
#155 opened Sep 3, 2025 by benjibc Loading…
10 of 16 tasks
fix decorator wrap for sync function
#105 opened Aug 20, 2025 by mayinghan Loading…
[WIP] Support fireworks login
#103 opened Aug 19, 2025 by benjibc Draft
[DRAFT] PR AUC
#98 opened Aug 19, 2025 by benjibc Loading…
reward bench 2 reimplementation
#90 opened Aug 18, 2025 by benjibc Draft
rename files and consolidate aime
#72 opened Aug 13, 2025 by benjibc Loading…
speed up import
#39 opened Aug 8, 2025 by benjibc Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.