feat: Turn report_metrics of ComparisonReport into Displays #1520

MarieSacksick · 2025-04-04T14:33:19Z

As part of issue #1782, turn the report_metrics of ComparisonReport into a display. The plot of this display will be a pair plot to visualize one metric against another.

Add documentation.

This is part of the narrative "I have several models, how can I choose the best one?". The user will need:

a comparison report
and plots to compare.

blocked by #1788.

github-actions · 2025-04-04T14:37:57Z

Coverage Report for backend

File	Stmts	Miss	Cover	Missing
venv/lib/python3.12/site-packages/skore
__init__.py	22	0	100%
_config.py	28	0	100%
exceptions.py	4	4	0%	4–23
venv/lib/python3.12/site-packages/skore/persistence
__init__.py	0	0	100%
venv/lib/python3.12/site-packages/skore/persistence/item
__init__.py	55	1	98%	97
altair_chart_item.py	19	1	91%	14
item.py	22	1	95%	86
matplotlib_figure_item.py	36	1	95%	19
media_item.py	22	0	100%
numpy_array_item.py	27	1	94%	16
pandas_dataframe_item.py	29	1	94%	14
pandas_series_item.py	29	1	94%	14
pickle_item.py	22	0	100%
pillow_image_item.py	25	1	93%	15
plotly_figure_item.py	20	1	92%	14
polars_dataframe_item.py	27	1	94%	14
polars_series_item.py	22	1	92%	14
primitive_item.py	23	2	91%	13–15
sklearn_base_estimator_item.py	29	1	94%	15
venv/lib/python3.12/site-packages/skore/persistence/repository
__init__.py	2	0	100%
item_repository.py	59	5	91%	15–16, 202–203, 226
venv/lib/python3.12/site-packages/skore/persistence/storage
__init__.py	4	0	100%
abstract_storage.py	22	0	100%
disk_cache_storage.py	33	1	95%	44
in_memory_storage.py	20	0	100%
venv/lib/python3.12/site-packages/skore/project
__init__.py	2	0	100%
project.py	84	2	98%	282, 394
venv/lib/python3.12/site-packages/skore/sklearn
__init__.py	6	0	100%
_base.py	170	14	92%	44, 57, 125, 128, 181–190, 202–>208, 223, 226–227
find_ml_task.py	61	0	99%	136–>145
types.py	13	2	85%	34, 62
venv/lib/python3.12/site-packages/skore/sklearn/_comparison
__init__.py	5	0	100%
metrics_accessor.py	167	2	98%	166, 167–>169, 1281
report.py	81	13	80%	18, 252–>255, 401–431
venv/lib/python3.12/site-packages/skore/sklearn/_cross_validation
__init__.py	5	0	100%
metrics_accessor.py	180	0	99%	144–>146, 146–>148
report.py	110	1	98%	23
venv/lib/python3.12/site-packages/skore/sklearn/_estimator
__init__.py	7	0	100%
feature_importance_accessor.py	133	0	99%	485–>491, 571–>580
metrics_accessor.py	345	10	96%	170–179, 207–>216, 215, 245, 256–>258, 286, 313–317, 332, 367, 368–>370
report.py	143	1	98%	24, 253–>255
venv/lib/python3.12/site-packages/skore/sklearn/_plot
__init__.py	2	0	100%
base.py	6	0	100%
style.py	28	0	100%
utils.py	122	5	95%	51, 75–77, 81
venv/lib/python3.12/site-packages/skore/sklearn/_plot/metrics
__init__.py	4	0	100%
precision_recall_curve.py	170	1	99%	656
prediction_error.py	162	0	100%
roc_curve.py	173	1	99%	646
venv/lib/python3.12/site-packages/skore/sklearn/train_test_split
__init__.py	0	0	100%
train_test_split.py	51	1	96%	16, 154–>158
venv/lib/python3.12/site-packages/skore/sklearn/train_test_split/warning
__init__.py	8	0	100%
high_class_imbalance_too_few_examples_warning.py	17	1	90%	79
high_class_imbalance_warning.py	18	0	100%
random_state_unset_warning.py	12	1	88%	15
shuffle_true_warning.py	10	1	83%	46
stratify_is_set_warning.py	12	1	88%	15
time_based_column_warning.py	23	2	86%	17, 73
train_test_split_warning.py	5	1	80%	21
venv/lib/python3.12/site-packages/skore/utils
__init__.py	6	0	100%
_accessor.py	42	1	96%	94
_environment.py	27	0	97%	30–>35
_fixes.py	8	0	100%
_index.py	5	0	100%
_logger.py	22	4	85%	15–19
_measure_time.py	10	0	100%
_parallel.py	38	3	88%	23–33, 124
_patch.py	13	5	53%	21–37
_progress_bar.py	34	0	100%
_show_versions.py	33	2	95%	65–66
TOTAL	3174	99	96%

Tests	Skipped	Failures	Errors	Time
793	8 💤	0 ❌	0 🔥	58.266s ⏱️

github-actions · 2025-04-04T14:42:51Z

Documentation preview @ a63dd79

glemaitre · 2025-04-07T13:13:23Z

From speaking with @GaelVaroquaux, it looks like this plot should be more general meaning that it should be a pairwise plot (comparing 2 scores). Time vs score is probably a good default but we should make it possible to tweak via the parameters.

github-actions · 2025-04-25T09:25:37Z

Coverage Report for backend

File	Stmts	Miss	Cover	Missing
venv/lib/python3.12/site-packages/skore
__init__.py	22	0	100%
_config.py	28	0	100%
exceptions.py	4	4	0%	4–23
venv/lib/python3.12/site-packages/skore/persistence
__init__.py	0	0	100%
venv/lib/python3.12/site-packages/skore/persistence/item
__init__.py	55	1	98%	97
altair_chart_item.py	19	1	91%	14
item.py	22	1	95%	86
matplotlib_figure_item.py	36	1	95%	19
media_item.py	22	0	100%
numpy_array_item.py	27	1	94%	16
pandas_dataframe_item.py	29	1	94%	14
pandas_series_item.py	29	1	94%	14
pickle_item.py	22	0	100%
pillow_image_item.py	25	1	93%	15
plotly_figure_item.py	20	1	92%	14
polars_dataframe_item.py	27	1	94%	14
polars_series_item.py	22	1	92%	14
primitive_item.py	23	2	91%	13–15
sklearn_base_estimator_item.py	29	1	94%	15
venv/lib/python3.12/site-packages/skore/persistence/repository
__init__.py	2	0	100%
item_repository.py	59	5	91%	15–16, 202–203, 226
venv/lib/python3.12/site-packages/skore/persistence/storage
__init__.py	4	0	100%
abstract_storage.py	22	0	100%
disk_cache_storage.py	33	1	95%	44
in_memory_storage.py	20	0	100%
venv/lib/python3.12/site-packages/skore/project
__init__.py	2	0	100%
project.py	83	2	98%	280, 392
venv/lib/python3.12/site-packages/skore/sklearn
__init__.py	6	0	100%
_base.py	171	14	92%	45, 58, 126, 129, 182–191, 203–>209, 224, 227–228
find_ml_task.py	61	0	99%	136–>145
types.py	13	0	100%
venv/lib/python3.12/site-packages/skore/sklearn/_comparison
__init__.py	5	0	100%
metrics_accessor.py	165	2	97%	163, 164–>166, 1278
report.py	81	13	80%	18, 250–>253, 399–429
venv/lib/python3.12/site-packages/skore/sklearn/_cross_validation
__init__.py	5	0	100%
metrics_accessor.py	190	0	99%	153–>155, 155–>157
report.py	110	1	98%	23
venv/lib/python3.12/site-packages/skore/sklearn/_estimator
__init__.py	7	0	100%
feature_importance_accessor.py	133	0	99%	483–>489, 569–>578
metrics_accessor.py	344	10	96%	174–183, 211–>220, 219, 249, 260–>262, 290, 317–321, 336, 371, 372–>374
report.py	148	1	98%	24, 253–>255
venv/lib/python3.12/site-packages/skore/sklearn/_plot
__init__.py	2	0	100%
base.py	6	0	100%
style.py	28	0	100%
utils.py	122	5	95%	51, 75–77, 81
venv/lib/python3.12/site-packages/skore/sklearn/_plot/metrics
__init__.py	4	0	100%
precision_recall_curve.py	173	1	99%	660
prediction_error.py	164	0	100%
roc_curve.py	176	1	99%	649
venv/lib/python3.12/site-packages/skore/sklearn/train_test_split
__init__.py	0	0	100%
train_test_split.py	51	1	96%	16, 154–>158
venv/lib/python3.12/site-packages/skore/sklearn/train_test_split/warning
__init__.py	8	0	100%
high_class_imbalance_too_few_examples_warning.py	17	1	90%	79
high_class_imbalance_warning.py	18	0	100%
random_state_unset_warning.py	12	1	88%	15
shuffle_true_warning.py	10	1	83%	46
stratify_is_set_warning.py	12	1	88%	15
time_based_column_warning.py	23	2	86%	17, 73
train_test_split_warning.py	4	0	100%
venv/lib/python3.12/site-packages/skore/utils
__init__.py	6	0	100%
_accessor.py	46	1	97%	102
_environment.py	27	0	97%	30–>35
_fixes.py	8	0	100%
_index.py	5	0	100%
_logger.py	22	4	85%	15–19
_measure_time.py	10	0	100%
_parallel.py	38	3	88%	23–33, 124
_patch.py	13	5	53%	21–37
_progress_bar.py	36	0	100%
_show_versions.py	33	0	100%
TOTAL	3199	94	96%

Tests	Skipped	Failures	Errors	Time
814	8 💤	0 ❌	0 🔥	54.175s ⏱️

github-actions · 2025-05-13T15:41:22Z

Coverage Report for skore/

File	Stmts	Miss	Cover	Missing
venv/lib/python3.12/site-packages/skore
__init__.py	23	0	100%
_config.py	28	0	100%
exceptions.py	4	4	0%	4, 15, 19, 23
venv/lib/python3.12/site-packages/skore/project
__init__.py	2	0	100%
metadata.py	67	0	100%
project.py	43	0	100%
reports.py	11	0	100%
widget.py	138	5	96%	375–377, 447–448
venv/lib/python3.12/site-packages/skore/sklearn
__init__.py	6	0	100%
_base.py	169	14	91%	45, 58, 126, 129, 182, 185–186, 188–191, 224, 227–228
find_ml_task.py	61	0	100%
types.py	26	1	96%	26
utils.py	1	0	100%
venv/lib/python3.12/site-packages/skore/sklearn/_comparison
__init__.py	5	0	100%
metrics_accessor.py	203	3	98%	170, 334, 1288
report.py	98	0	100%
utils.py	55	0	100%
venv/lib/python3.12/site-packages/skore/sklearn/_cross_validation
__init__.py	5	0	100%
metrics_accessor.py	211	1	99%	334
report.py	125	1	99%	480
venv/lib/python3.12/site-packages/skore/sklearn/_estimator
__init__.py	7	0	100%
feature_importance_accessor.py	143	2	98%	216–217
metrics_accessor.py	382	9	97%	162, 191, 193, 200, 291, 360, 364, 379, 414
report.py	166	2	98%	454–455
venv/lib/python3.12/site-packages/skore/sklearn/_plot
__init__.py	2	0	100%
base.py	5	0	100%
style.py	28	0	100%
utils.py	118	5	95%	50, 74–76, 80
venv/lib/python3.12/site-packages/skore/sklearn/_plot/metrics
__init__.py	6	0	100%
confusion_matrix.py	69	4	94%	90, 98, 120, 228
pair_plot.py	67	8	88%	54, 57, 89, 150, 164, 174, 178, 219
precision_recall_curve.py	230	1	99%	716
prediction_error.py	160	0	100%
roc_curve.py	242	4	98%	380, 497, 598, 791
venv/lib/python3.12/site-packages/skore/sklearn/train_test_split
__init__.py	0	0	100%
train_test_split.py	49	0	100%
venv/lib/python3.12/site-packages/skore/sklearn/train_test_split/warning
__init__.py	8	0	100%
high_class_imbalance_too_few_examples_warning.py	17	1	94%	80
high_class_imbalance_warning.py	18	0	100%
random_state_unset_warning.py	10	0	100%
shuffle_true_warning.py	10	1	90%	46
stratify_is_set_warning.py	10	0	100%
time_based_column_warning.py	21	1	95%	73
train_test_split_warning.py	4	0	100%
venv/lib/python3.12/site-packages/skore/utils
__init__.py	6	2	66%	8, 13
_accessor.py	52	2	96%	67, 108
_environment.py	27	0	100%
_fixes.py	8	0	100%
_index.py	5	0	100%
_logger.py	22	4	81%	15–17, 19
_measure_time.py	10	0	100%
_parallel.py	38	3	92%	23, 33, 124
_patch.py	13	5	61%	21, 23–24, 35, 37
_progress_bar.py	45	0	100%
_show_versions.py	33	2	93%	65–66
_testing.py	37	0	100%
TOTAL	3349	85	97%

Tests	Skipped	Failures	Errors	Time
830	5 💤	0 ❌	0 🔥	1m 1s ⏱️

skore/src/skore/sklearn/_plot/metrics/pair_plot.py

skore/src/skore/sklearn/_comparison/report.py

Co-authored-by: Auguste Baum <[email protected]>

thomass-dev · 2025-05-26T10:31:15Z

[automated comment] Please update your PR with main, so that the pytest workflow status will be reported.

glemaitre

I think that we should think about the API question before to go further. It would have an impact on how the display will be created.

If we have a display with report_metrics, it only means that we will share a common display and the kind parameters will decide to show a representation or another.

glemaitre · 2025-05-27T11:30:04Z

skore/src/skore/sklearn/_plot/metrics/pair_plot.py

@@ -0,0 +1,220 @@
+import matplotlib.pyplot as plt
+
+from skore.sklearn._plot.base import Display


Display is a Python protocol (https://peps.python.org/pep-0544/) and we don't need to inherit from it.
It only allows to specify the methods that an object should be implementing when calling isinstance(obj, Display).

Suggested change

from skore.sklearn._plot.base import Display

skore/src/skore/sklearn/_plot/metrics/pair_plot.py

glemaitre · 2025-05-27T11:33:37Z

skore/src/skore/sklearn/_plot/metrics/pair_plot.py

+        self.ax_ = None
+        self.text_ = None
+
+    def plot(self, ax=None, **kwargs):


It is this method that would benefit from the style.

Suggested change

def plot(self, ax=None, **kwargs):

@StyleDisplayMixin.style_plot

def plot(self, ax=None, **kwargs):

we don't need ax anymore. We decided with @auguste-probabl to reduce the API here.

You can also remove kwargs because it is unused.

glemaitre · 2025-05-27T11:44:39Z

skore/src/skore/sklearn/_plot/metrics/pair_plot.py

+    @classmethod
+    def from_metrics(
+        cls,
+        metrics,
+        perf_metric_x,
+        perf_metric_y,
+        data_source=None,
+    ):


The display should not expose this public function. The idea is that the reporters will be the only object that can create an instance of Display.

You can have a look at RocCurveDisplay (or PrecisionRecallDisplay). Basically I think we should keep the name _compute_data_for_display. However, we can adapt the input parameters.

glemaitre · 2025-05-27T11:52:43Z

skore/src/skore/sklearn/_comparison/report.py

+        # - add kwargs (later)
+
+        return PairPlotDisplay.from_metrics(
+            metrics=self.metrics.report_metrics(


One thing that I realized with the implementation now is that we are going to want most of the parameters to pass them to report_metrics.

Now, I'm thinking that it would means that the PairPlotDisplay is just a kind of plot associated with report_metrics. In short, I think that it would make sense to be able to write:

report.metrics.report_metrics().plot(kind="pair", x="fit_time", y="accuracy")

but also

report.metrics.report_metrics().plot(kind="bar")

And it allows to pass the arguments as:

report.metrics.report_metrics(data_source="train", ...).plot(kind="pair")

glemaitre · 2025-05-27T11:53:52Z

skore/src/skore/sklearn/_plot/metrics/pair_plot.py

+        self.figure_ = None
+        self.ax_ = None
+        self.text_ = None


For consistency, those are created only when plot is called. We can see in a subsequent PR if we want to make consistent this behaviour with an initialization.

glemaitre · 2025-05-27T11:56:27Z

skore/src/skore/sklearn/_plot/metrics/pair_plot.py

+        x_label = _SCORE_OR_LOSS_INFO.get(perf_metric_x, {}).get("name", perf_metric_x)
+        y_label = _SCORE_OR_LOSS_INFO.get(perf_metric_y, {}).get("name", perf_metric_y)


I think that those should be passed directly by the methods from the report. It would be handy because we would have access to the dictionary _SCORE_OR_LOSS_INFO in the report side.

Co-authored-by: Guillaume Lemaitre <[email protected]>

MarieSacksick · 2025-06-09T14:34:34Z

Closing - fresh start in #1816.

MarieSacksick added 3 commits April 4, 2025 16:03

docs: objective of the branch

aa20336

feat: start the function

881e2b6

add comment

aa16872

github-actions bot assigned MarieSacksick Apr 4, 2025

MarieSacksick changed the title ~~enh: Add timing plot in comparison report~~ feat: Add timing plot in comparison report Apr 4, 2025

save before we

987bb13

notes feedback from Guillaume

63b878e

add utils docstring

c8c5ddb

MarieSacksick force-pushed the timing_plot branch from 7b0ae03 to c8c5ddb Compare May 13, 2025 15:35

Merge branch 'main' into timing_plot

f8fee6d

MarieSacksick and others added 10 commits May 13, 2025 17:51

feat pairwise: handle missing pos label

1524d46

improve feat

653326f

Merge branch 'main' into timing_plot

b8866ac

turn into display

e288561

complete docstrings

43e83a8

add tests

86da68e

correct docstring and comments

fdc7dfb

bugfix

de68fd3

fix tests

37ddfd9

Merge branch 'main' into timing_plot

e71d9db

MarieSacksick marked this pull request as ready for review May 23, 2025 09:11

MarieSacksick requested a review from auguste-probabl May 23, 2025 09:11

auguste-probabl reviewed May 23, 2025

View reviewed changes

skore/src/skore/sklearn/_plot/metrics/pair_plot.py Outdated Show resolved Hide resolved

auguste-probabl reviewed May 23, 2025

View reviewed changes

skore/src/skore/sklearn/_comparison/report.py Outdated Show resolved Hide resolved

Update skore/src/skore/sklearn/_comparison/report.py

ae1fcb9

Co-authored-by: Auguste Baum <[email protected]>

remove traces from inspiration

cd083ac

MarieSacksick changed the title ~~feat: Add timing plot in comparison report~~ feat: Add pair plot in comparison report May 26, 2025

Merge branch 'main' into timing_plot

cc2f43c

glemaitre self-requested a review May 27, 2025 09:00

glemaitre reviewed May 27, 2025

View reviewed changes

MarieSacksick and others added 4 commits May 28, 2025 15:05

Update skore/src/skore/sklearn/_plot/metrics/pair_plot.py

72eb0dc

Co-authored-by: Guillaume Lemaitre <[email protected]>

Update skore/src/skore/sklearn/_plot/metrics/pair_plot.py

254648f

Co-authored-by: Guillaume Lemaitre <[email protected]>

Update skore/src/skore/sklearn/_plot/metrics/pair_plot.py

3294dc5

Co-authored-by: Guillaume Lemaitre <[email protected]>

linting

f482c5f

MarieSacksick marked this pull request as draft May 30, 2025 12:55

Merge branch 'main' into timing_plot

a63dd79

MarieSacksick changed the title ~~feat: Add pair plot in comparison report~~ feat: Turn report_metrics of ComparisonReport into Displays Jun 2, 2025

MarieSacksick closed this Jun 9, 2025

MarieSacksick deleted the timing_plot branch July 23, 2025 09:56

		@@ -0,0 +1,220 @@
		import matplotlib.pyplot as plt

		from skore.sklearn._plot.base import Display

	def plot(self, ax=None, **kwargs):
	@StyleDisplayMixin.style_plot
	def plot(self, ax=None, **kwargs):

		x_label = _SCORE_OR_LOSS_INFO.get(perf_metric_x, {}).get("name", perf_metric_x)
		y_label = _SCORE_OR_LOSS_INFO.get(perf_metric_y, {}).get("name", perf_metric_y)

feat: Turn report_metrics of ComparisonReport into Displays #1520

feat: Turn report_metrics of ComparisonReport into Displays #1520

Uh oh!

Conversation

MarieSacksick commented Apr 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Apr 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Apr 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

glemaitre commented Apr 7, 2025

Uh oh!

github-actions bot commented Apr 25, 2025

Uh oh!

github-actions bot commented May 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

thomass-dev commented May 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

glemaitre left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

MarieSacksick commented Jun 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

MarieSacksick commented Apr 4, 2025 •

edited

Loading

github-actions bot commented Apr 4, 2025 •

edited

Loading

github-actions bot commented Apr 4, 2025 •

edited

Loading

github-actions bot commented May 13, 2025 •

edited

Loading

thomass-dev commented May 26, 2025 •

edited

Loading