Releases: Corbell-AI/evalmonkey
Releases · Corbell-AI/evalmonkey
v1.0.1
09 May 06:22
Compare
Sorry, something went wrong.
No results found
What's Changed
feat: add framework adapters for LangGraph, LlamaIndex, and PydanticAI by @himmi-01 in #4
Full Changelog : v1.0.0...v1.0.1
v1.0.0
06 May 05:56
Compare
Sorry, something went wrong.
No results found
v0.1.3
03 May 20:14
Compare
Sorry, something went wrong.
No results found
implement automated eval asset generation and improvement prompts for failed benchmark traces ea99606
optimize benchmark loading by enabling streaming mode and add testing/inspection utilities 88a8fb5
strip markdown code fences from LLM judge responses 46be866
remove webarena benchmark 179d5ab
v0.1.2
26 Apr 01:15
Compare
Sorry, something went wrong.
No results found
v0.1.1
21 Apr 02:08
Compare
Sorry, something went wrong.
No results found
Let users bring their own benchmark dataset and allow running all chaos tests via single CLI command 88fecf8
v0.1.0
18 Apr 22:52
Compare
Sorry, something went wrong.
No results found
EvalMonkey's first release with MCP server support.
8 Agent framework supported
10 of the shelf benchmarks supported
7 chaos scenarios supported
benchmark historical data on TUI