Feat/eval on dataset by qhua360 · Pull Request #392 · galilai-group/stable-pretraining

qhua360 · 2026-02-24T06:29:30Z

Description

Create a callback to run evals on arbitrary datasets. Also add a function for mapping callbacks to accepted eval functions.

Working example here https://github.com/galilai-group/clipa/blob/eval-on-dataset-rewrite/clip.py

Checklist

I have read the Contributing document.
The documentation is up-to-date with the changes I made (check build artifacts).
All tests passed, and additional code has been covered with new tests.
I have added the PR to the RELEASES.rst file.

Add reusable EvalOnDataset callback that runs evaluation functions on arbitrary datasets every N epochs with DDP support, and a callback_to_evaluator adapter that wraps existing Lightning callbacks (CLIPZeroShot, OnlineKNN, OnlineProbe) into evaluator functions. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Replace single name/data/evaluators params with a list of EvalDatasetEntry dataclasses so one callback handles all eval runs sequentially with a single DDP barrier, matching the original ZeroShotEvalCallback behavior. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Let Lightning handle rank coordination and logger dispatch via log_dict(sync_dist=True) instead of manually checking is_global_zero and calling trainer.logger.log_metrics. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

qhua360 and others added 4 commits February 20, 2026 16:59

Set to device

85beac6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/eval on dataset#392

Feat/eval on dataset#392
qhua360 wants to merge 4 commits intogalilai-group:mainfrom
qhua360:feat/eval-on-dataset

qhua360 commented Feb 24, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

qhua360 commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

qhua360 commented Feb 24, 2026 •

edited

Loading