Fix off‑by‑one when using --max_examples in language/bert/evaluate_v1.1.py #2266

sbite0138 · 2025-07-18T13:18:55Z

The script provides a --max_examples option to limit the number of evaluation samples.
However, total is incremented and compared to max_examples before the per‑example score is recorded.
If --max_examples is specified, the statistics for the final sample are skipped, so the reported metrics are based on N – 1 examples instead of N.

This PR moves the total += 1 and the if max_examples and max_examples == total: break logic after the score update. This lets the last sample contribute to the metrics, ensuring the results accurately reflect all N examples.

github-actions · 2025-07-18T13:19:04Z

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

sbite0138 · 2025-07-18T14:44:19Z

recheck

Fix counter increment position in BERT evaluation script

c6ce1c7

sbite0138 requested a review from a team as a code owner July 18, 2025 13:18

Merge branch 'master' into fix-bert-evaluation-counter

6f46fef

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix off‑by‑one when using --max_examples in language/bert/evaluate_v1.1.py #2266

Fix off‑by‑one when using --max_examples in language/bert/evaluate_v1.1.py #2266

Uh oh!

sbite0138 commented Jul 18, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jul 18, 2025 •

edited

Loading

Uh oh!

sbite0138 commented Jul 18, 2025

Uh oh!

Uh oh!

Fix off‑by‑one when using --max_examples in language/bert/evaluate_v1.1.py #2266

Are you sure you want to change the base?

Fix off‑by‑one when using --max_examples in language/bert/evaluate_v1.1.py #2266

Uh oh!

Conversation

sbite0138 commented Jul 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jul 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sbite0138 commented Jul 18, 2025

Uh oh!

Uh oh!

Fix off‑by‑one when using --max_examples in language/bert/evaluate_v1.1.py #2266

Fix off‑by‑one when using --max_examples in language/bert/evaluate_v1.1.py #2266

sbite0138 commented Jul 18, 2025 •

edited

Loading

github-actions bot commented Jul 18, 2025 •

edited

Loading