Releases · Corbell-AI/evalmonkey

09 May 06:22

himmi-01

v1.0.1

d91c46d

v1.0.1 Latest

Latest

What's Changed

feat: add framework adapters for LangGraph, LlamaIndex, and PydanticAI by @himmi-01 in #4

Full Changelog: v1.0.0...v1.0.1

Contributors

himmi-01

Assets 2

06 May 05:56

himmi-01

v1.0.0

1ad2b0e

v1.0.0

What's Changed

feat: evalmonkey web ui and benchmark stability fixes by @himmi-01 in #3

Full Changelog: v0.1.3...v1.0.0

Contributors

himmi-01

Assets 2

03 May 20:14

himmi-01

v0.1.3

82a5925

v0.1.3

implement automated eval asset generation and improvement prompts for failed benchmark traces ea99606
optimize benchmark loading by enabling streaming mode and add testing/inspection utilities 88a8fb5
strip markdown code fences from LLM judge responses 46be866
remove webarena benchmark 179d5ab