Skip to content

Conversation

bodsul
Copy link

@bodsul bodsul commented May 2, 2025

Add evaluation for newly released CAPTURE dataset. Arxiv preprint: https://arxiv.org/abs/2504.15485, repo: https://github.com/atinpothiraj/CAPTURe.

Opened issue regarding prompt choice and using Llama-3.1-8B-Instruct to extract answers here. Will update PR from WIP once this issues are clarified.

@kennymckormick
Copy link
Member

Hi, @bodsul ,

Maybe you can try to re-implement and use some API models (like gpt-4o / gpt-4.1-mini). If you don't have the credit for GPT API, you can just implement in this PR and I'll help check the results.

@atinpothiraj
Copy link

atinpothiraj commented May 30, 2025

Added my comments to the issue in the original repo, lmk if any other clarification is needed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants