Fix set-zip misalignment in PerClassScorer.__call__ by Chessing234 · Pull Request #571 · allenai/scispacy

Chessing234 · 2026-04-13T13:53:02Z

Bug

In PerClassScorer.__call__(), untyped_predicted_spans is built as a set comprehension (line 21), then zipped with predicted_spans (a list) on line 23. Since Python sets are unordered, the i-th element yielded by iterating the set does not correspond to the i-th element of the list. This means untyped_span and span in each loop iteration refer to unrelated predictions, corrupting both the typed and untyped precision/recall/F1 metrics.

A secondary issue: if multiple predicted spans share the same (start, end) but differ in label, the set deduplicates them, making it shorter than the list. zip silently stops at the shorter iterable, so some predictions are never evaluated.

Root cause

Line 23 assumes set iteration order matches list order, which is not guaranteed.

Fix

Derive untyped_span directly from span inside the loop body instead of zipping with the set. This guarantees the untyped version always corresponds to the correct typed prediction.

untyped_predicted_spans is a set (unordered), so zipping it with predicted_spans (a list) pairs unrelated spans together. Derive untyped_span directly from span inside the loop instead.

Fix set-zip misalignment in PerClassScorer.__call__

63c3eb1

untyped_predicted_spans is a set (unordered), so zipping it with predicted_spans (a list) pairs unrelated spans together. Derive untyped_span directly from span inside the loop instead.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix set-zip misalignment in PerClassScorer.call#571

Fix set-zip misalignment in PerClassScorer.call#571
Chessing234 wants to merge 1 commit intoallenai:mainfrom
Chessing234:fix/per-class-scorer-set-zip-misalignment

Chessing234 commented Apr 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Chessing234 commented Apr 13, 2026

Bug

Root cause

Fix

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant