Evidence-first structured scoring for AI artifacts: synthesize a rubric, collect citations, score only against quoted evidence, hedge on thin evidence. Every dimension ties to a file:line or quote, with reproducibility receipts. For papers, PRs, prompts, and emails where a defensible LLM-backed score beats a headline number. 7 backends.
-
Updated
May 27, 2026 - Python