v19 checklist-driven fresh audit: CLAUDE.md + last 5,930/7,612 drifts#35
Closed
lucapinello wants to merge 1 commit intochorus-applicationsfrom
Closed
v19 checklist-driven fresh audit: CLAUDE.md + last 5,930/7,612 drifts#35lucapinello wants to merge 1 commit intochorus-applicationsfrom
lucapinello wants to merge 1 commit intochorus-applicationsfrom
Conversation
First audit walked against the 18-section checklist shipped in PRs #33 + #34. Artefacts land in audits/2026-04-21_v19_fresh_audit/ per the checklist appendix: 16 selenium-rendered screenshots (18 HTMLs; 2 basename collisions across dirs), CDF sanity output, per-env device probe, consistency greps, Python API probe, determinism check, pytest log. ## CLAUDE.md at repo root Points every future Claude Code session at audits/AUDIT_CHECKLIST.md before any "ship-ready" pass, so the checklist doesn't drift by memory. Also mentions the audits/ chronology for cumulative context. ## Fixes — the last two `5,930` / `7,612` in live code 1. chorus/oracles/alphagenome.py:22 — AlphaGenomeOracle docstring said "5,930 human functional genomic tracks"; real count is 5,731 (matches v16/v17/v18 fixes to notebooks, README, server.py, metadata). This was the last 5,930 in the live codebase. 2. scripts/build_backgrounds_borzoi.py:4 — module docstring said "all 7,612 Borzoi tracks"; real count is 7,611 (matches v17 fix to scripts/README.md). ## Checklist results (PASS unless noted) - §3 GPU/device matrix: all 6 envs detect Metal/MPS on macOS arm64. - §4 CDF sanity: monotonic effect_cdfs + p50<=p95<=p99 + signed% all match expected semantics for all 6 oracles. - §5 Python API: sequence_length matches spec for all 6; invalid oracle name gives helpful ValueError; ModelNotLoadedError clear. - §7 HTML reports: 18/18 shipped HTMLs render via selenium with 0 JS console errors. Screenshots committed. - §10 consistency: 2 stale 5,930/7,612 drifts — fixed here. - §11 pytest: 334 passed / 1 skipped (9m). - §13 determinism (mock): PASS; real-oracle check deferred to release host. - §15 offline: 0 runtime CDN fetches (greps for <script src="http…" and <link href="http…" empty across all 18 HTMLs). - §16 logging hygiene: no committed HF tokens or AWS keys. - §18 LICENSE present (MIT, Pinello Lab). ## Flagged, NOT fixed here - §18 No NOTICE / docs/THIRD_PARTY.md attributing the 6 oracle models or the bundled IGV.js. P1 for release. - §17 pip-audit not in base env. Add to environment.yml dev deps or run as release-host CI step. - §1 & §14 (fresh install + genomics edge cases) need a release host with ~80 GB disk and hours of runtime — belong in the release audit. Full report at audits/2026-04-21_v19_fresh_audit/report.md. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
Contributor
Author
|
Superseded by other agent's v19/v20 work which landed most of the same fixes (5,731 docstring, CLAUDE.md, checklist additions) with a better CLAUDE.md. Cherry-picking the remaining one-line 7,612→7,611 fix into a new PR along with a bug fix for the §14.4 KeyError('H') finding. |
4 tasks
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
First audit walked against the 18-section runbook in
audits/AUDIT_CHECKLIST.md(shipped in PRs #33 + #34). Leaves behind the full artefact bundle the checklist appendix asks for.Durable guidance —
CLAUDE.mdat repo rootNew top-level
CLAUDE.mdtells every future Claude Code session to readaudits/AUDIT_CHECKLIST.mdfirst before any "ship-ready" pass. Walks should top-to-bottom against the checklist and drop artefacts inaudits/YYYY-MM-DD_vNN_<label>/.Fixes — the last two stale canonical numbers in live code
chorus/oracles/alphagenome.py:22—AlphaGenomeOracledocstring said "5,930 human functional genomic tracks"; real count is 5,731. Matches v16/v17/v18 fixes to notebooks, README, MCP server, metadata. This was the last5,930anywhere live.scripts/build_backgrounds_borzoi.py:4— module docstring said "all 7,612 Borzoi tracks"; real count is 7,611. Matches v17 fix toscripts/README.md.Checklist results (PASS unless noted)
p50≤p95≤p99+ signed% match semantics for all 6 oraclessequence_lengthmatches spec; errors clear<script src="http…">/<link href="http…">greps empty; apparent "CDN refs" earlier were attribution comments inside bundled IGV.js)LICENSEpresent (MIT, Pinello Lab)Known issues flagged, NOT fixed here
NOTICE/docs/THIRD_PARTY.md— 6 oracle models and bundled IGV.js should be attributed in one reachable place. P1 for release.pip-auditnot in base env — add toenvironment.ymldev deps or run as release-host CI step.Artefacts in
audits/2026-04-21_v19_fresh_audit/report.md— this summaryscreenshots/*.png(16) — selenium-rendered HTML reports at 1600×4500cdf_check.txt,device_probe.txt,python_api.txt,determinism.txt,consistency_grep.txt,cdn_runtime.txt,pip_audit.txt,pytest.txtFuture audits diff against these.
Test plan
pytest tests/ --ignore=tests/test_smoke_predict.py -q→ 334 passed / 1 skipped (9m)grep '5,930\|7,612' --include='*.md' --include='*.py' -r .after fixes → empty (live code only;audits/historical snapshots excluded)🤖 Generated with Claude Code