Skip to content

Commit f90822a

Browse files
Address feedback from PR; move integration test, drop note to performance in the repo itself instead of just PR
1 parent 4a82d3d commit f90822a

File tree

2 files changed

+3
-217
lines changed

2 files changed

+3
-217
lines changed

README.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -48,6 +48,9 @@ promptfoo eval -c promptfooconfig.yaml
4848

4949
The eval uses `formfyxer/tests/passive_voice_test_dataset.csv` to ensure sentences labeled as passive produce a non-empty `fragments` array. If you change the prompt inside `formfyxer/passive_voice_detection.py`, copy the updated text (the `system_prompt` and numbering format) into `promptfooconfig.yaml` so the evaluation mirrors runtime behavior.
5050

51+
Raw percentage correct performance using gpt-5 nano on the benchmark dataset is **95.56%.** (Did not
52+
calculate more specific F1 score).
53+
5154
## Functions
5255

5356
Functions from `pdf_wrangling` are found on [our documentation site](https://suffolklitlab.org/docassemble-AssemblyLine-documentation/docs/reference/formfyxer/pdf_wrangling).

test_passive_voice_detection.py

Lines changed: 0 additions & 217 deletions
This file was deleted.

0 commit comments

Comments
 (0)