skill-eval — local eval harness for Agent Skills on headless Gemini CLI #25983
fede0089
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I open-sourced
skill-eval, a local eval harness for Agent Skills running on headless Gemini CLI.The problem it tries to solve is simple: skills are easy to demo, but hard to validate reliably.
A skill can look great in one run and still:
skill-evalfocuses on evidence:Repo: [github.com/fede0089/skill-eval](https://github.com/fede0089/skill-eval)
Install:
npm i -g skill-evalWould especially love feedback on what would make this genuinely useful for skill authors
Beta Was this translation helpful? Give feedback.
All reactions