-
Notifications
You must be signed in to change notification settings - Fork 1.3k
Pull requests: confident-ai/deepeval
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
chore: bump posthog to 7.x for Python >=3.10 via env markers
#2605
opened Apr 9, 2026 by
sipa-echo-ngbm
Loading…
fix(tracing): populate token counts and cost for gpt-5.x LLM spans
#2601
opened Apr 4, 2026 by
tiffanychum
Loading…
3 tasks
feat(test_case): make trace_dict public for post-hoc agentic evaluation
#2600
opened Apr 4, 2026 by
tiffanychum
Loading…
3 tasks
fix: multi-root traces silently drop root spans from evaluation and export
#2599
opened Apr 4, 2026 by
aerosta
Contributor
Loading…
fix: preserve expected_outcome in conversational golden conversion
#2598
opened Apr 4, 2026 by
aerosta
Contributor
Loading…
fix: batched upload permanently truncates in-memory test run
#2597
opened Apr 4, 2026 by
aerosta
Contributor
Loading…
Add AG2 integration for multi-agent tracing
#2596
opened Apr 3, 2026 by
faridun-ag2
Loading…
8 tasks done
fix: replace PrivateAttr with @property for Turn._mcp_interaction
#2595
opened Apr 3, 2026 by
bongho
Loading…
Fix/predictable temp file and race condition in gpu utils vulnerability
#2593
opened Apr 2, 2026 by
AseemPrasad
Loading…
fixing secure_exec sandbox escape via getattr vulnerability
#2592
opened Apr 2, 2026 by
AseemPrasad
Loading…
examples: add RAIL Score responsible AI evaluation example
#2591
opened Apr 2, 2026 by
SumitVermakgp
Loading…
add agent evaluation example with @observe and component-level tracing
#2585
opened Mar 31, 2026 by
Ajay6601
Loading…
add claude 4.6 models and fix opus 4.5 and updated default anthropic model
#2584
opened Mar 31, 2026 by
Ajay6601
Loading…
feat: add penalize_ambiguous_claims to AnswerRelevancyMetric
#2573
opened Mar 25, 2026 by
Krishnachaitanyakc
Loading…
3 tasks
fix(gemini): fix log_probs detection, temperature guard, and add Gemini 3.x models
#2572
opened Mar 23, 2026 by
GerardoYalo
Loading…
fix(ragas): update capture_metric_type call for new telemetry signature
#2568
opened Mar 22, 2026 by
sachinML
Loading…
Add GoodMem integration for memory-powered retrieval
#2566
opened Mar 19, 2026 by
bassammalik
Loading…
4 of 5 tasks
Fix: Parse CSV Single-Turn Golden ToolCalls as JSON objects
#2565
opened Mar 19, 2026 by
seankelley-dt
Loading…
fix: include tool and trace state in evaluation cache keys
#2561
opened Mar 19, 2026 by
aerosta
Contributor
Loading…
fix: preserve metric snapshots when async metric tasks fail in indicator
#2560
opened Mar 18, 2026 by
aerosta
Contributor
Loading…
feat: add native Groq model integration for high-speed evaluations
#2556
opened Mar 17, 2026 by
Jayachander123
Loading…
4 tasks done
Add tags field to Golden and ConversationalGolden
#2543
opened Mar 9, 2026 by
brian-romain
Contributor
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.