Skip to content

Testing Performance#39

Closed
FloofCat wants to merge 10 commits intoliamdugan:mainfrom
FloofCat:main
Closed

Testing Performance#39
FloofCat wants to merge 10 commits intoliamdugan:mainfrom
FloofCat:main

Conversation

@FloofCat
Copy link
Copy Markdown
Contributor

@FloofCat FloofCat commented May 3, 2025

Hi @liamdugan,

We're trying to actively test two of our new frameworks on RAID. Please allow for evaluation as soon as possible!

Thank you!

@github-actions
Copy link
Copy Markdown

github-actions Bot commented May 4, 2025

Eval run succeeded! Link to run: link

Here are the results of the submission(s):

divi

Release date: 2025-05-03

I've committed detailed results of this detector's performance on the test set to this PR.

On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved a TPR of 59.30% at FPR=5%.
Without adversarial attacks, it achieved a TPR of 76.95% at FPR=5%.

divi-pro

Release date: 2025-05-03

I've committed detailed results of this detector's performance on the test set to this PR.

On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved a TPR of 79.29% at FPR=5%.
Without adversarial attacks, it achieved a TPR of 92.85% at FPR=5%.
If all looks well, a maintainer will come by soon to merge this PR and your entry/entries will appear on the leaderboard. If you need to make any changes, feel free to push new commits to this PR. Thanks for submitting to RAID!

@FloofCat
Copy link
Copy Markdown
Contributor Author

FloofCat commented May 6, 2025

Interesting. I have a few other models that I'll be pushing soon for evaluation.

Thanks, please don't push to the leaderboard yet.

@FloofCat
Copy link
Copy Markdown
Contributor Author

FloofCat commented May 9, 2025

@liamdugan,

Please allow for evaluation of the same.

@liamdugan
Copy link
Copy Markdown
Owner

Yep @FloofCat the bot's comment from earlier was updated with the newly evaluated scores!

@FloofCat
Copy link
Copy Markdown
Contributor Author

FloofCat commented Oct 3, 2025

Closing, will open a new PR for updated results soon.

@FloofCat FloofCat closed this Oct 3, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants