feat: Terminal-Bench 2.0 harness, Pi agent wrapper, and vox bench subcommand#2
Draft
Copilot wants to merge 3 commits into
Draft
feat: Terminal-Bench 2.0 harness, Pi agent wrapper, and vox bench subcommand#2Copilot wants to merge 3 commits into
vox bench subcommand#2Copilot wants to merge 3 commits into