Skip to content

Conversation

reneenoble
Copy link

Update prompt info for system.

@reneenoble
Copy link
Author

/evaluate

Copy link

Starting evaluation! Check the Actions tab for progress, or wait for a comment with the results.

Copy link

metric stat baseline pr113
gpt_groundedness pass_rate 1.0 0.0
mean_rating 5.0 1.0
gpt_relevance pass_rate 1.0 0.0
mean_rating 5.0 1.0
answer_length mean 978.9 230.0
latency mean 2.51 1.23
citation_match rate 1.0 1.0
num_questions total 10 10

@pamelafox pamelafox closed this Oct 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants