Skip to content

Reconcile pair labels with 4-criterion vectors across 9 split units#169

Merged
LuminLynx merged 1 commit into
mainfrom
claude/label-reconciliation
May 21, 2026
Merged

Reconcile pair labels with 4-criterion vectors across 9 split units#169
LuminLynx merged 1 commit into
mainfrom
claude/label-reconciliation

Conversation

@LuminLynx
Copy link
Copy Markdown
Owner

Summary

Cleanup-only PR. After the c2-split rollout renumbered each unit's rubric from 3 → 4 criteria, many human-readable pair label: strings in the regression sets still described the pre-split layout — reading "all three met" or naming the old c1/c2/c3 positions. Those labels contradicted the actual post-split expected.criteria met/missing vectors (the exact class of inconsistency Codex flagged on the Unit 8 review).

This reconciles the labels for the 9 merged split units (2, 3, 4, 5, 6, 7, 10, 11, 12) so every label's criteria summary matches its 4-criterion vector.

  • 85 label lines rewritten across 9 regression sets.
  • Zero changes to any expected.criteria value or flagged flag — text only (diff is 85 insertions / 85 deletions, all on label: lines).
  • Includes the last straggler: latency p017, a [ttft] pseudocode edge case still labeled "all three met" → now c1 + c2 + c4 met, c3 missing.

Test plan

  • run_regression_set.py --check passes on all 9 bundles
  • git diff confirms only label: lines changed (no expected-value or flagged edits)
  • No remaining label: line claims "all three met" against a 4-criterion vector

https://claude.ai/code/session_019xEvNkByf5ic4kbMZFdKDR


Generated by Claude Code

Rewrites stale human-readable pair labels in the regression sets for the
9 merged c2-split units (2,3,4,5,6,7,10,11,12) so each label's criteria
summary matches its post-split 4-criterion expected vector. Many labels
still read "all three met" or named the pre-split c1/c2/c3 layout, which
contradicted the actual met/missing positions after the c2-split renumber.

Label text only — zero changes to any expected.criteria value or flagged
flag. All 9 sets pass --check schema validation.

https://claude.ai/code/session_019xEvNkByf5ic4kbMZFdKDR
@LuminLynx LuminLynx merged commit 4d9365d into main May 21, 2026
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants