Use worst case method for MKI / KI by Damonamajor · Pull Request #34 · ccao-data/assesspy

Damonamajor · 2025-09-02T15:28:38Z

This uses a worst case scenario method for identifying MKI / KI values in order to ensure reproducibility.

Damonamajor · 2025-09-02T21:10:16Z

-    df.sort_values(by="sale_price", kind="mergesort", inplace=True)
+    df.sort_values(
+        by=["sale_price", "estimate"],
+        ascending=[True, False],


Uses False for ascending order for estimate in accordance with our external guidance.

After a lot of deliberation, we decided the best way forward is to assume the "worst case scenario" in terms of MKI/KI metrics by sorting the data first by the ascending actual value (sale price) and then by the descending predicted value (modeled result). Not saying this has to be your solution, but wanted to share our thinking if helpful.

jeancochrane

Great work here! Overall I think this looks right, just a few small nitpicks below related to the test definition.

jeancochrane · 2025-09-03T16:02:53Z

+@pt.mark.parametrize("metric", ["mki", "ki"])
+def test_quintos_metric_matches_across_estimates(metric):
+    """
+    For the quintos dataset, MKI/KI should be identical based
+    on the ordering of estimates.
+    """
+    sample = ap.quintos_sample()


[Nitpick, optional] For consistency with other tests, I think it would make sense to switch to using a fixture here. Note how the unchanged tests above include the quintos_data fixture via a function parameter -- this is technically a fixture definition that inherits from the quintos_data fixture, but the principle is the same as if we were including the fixture in a test:

assesspy/assesspy/tests/test_metrics.py

Lines 11 to 15 in a0d0359

@pt.fixture

def metric_val(self, metric, ccao_data, quintos_data):

if metric in ["mki", "ki"]:

return getattr(ap, metric)(*quintos_data)

return getattr(ap, metric)(*ccao_data)

Here's the definition of the quintos_data fixture, which pytest loads automatically from conftest.py on startup so it can pass the fixture into any fixture or function that includes it in its function parameters:

assesspy/assesspy/tests/conftest.py

Lines 24 to 27 in a0d0359

@pt.fixture(scope="session")

def quintos_data() -> tuple:

sample = ap.quintos_sample()

return sample.estimate, sample.sale_price

If we follow my recommendation above to save the new data in a new sample file, we would need to define a new quintos_data_with_tiebreaks fixture in conftest.py and then include it here:

Suggested change

@pt.mark.parametrize("metric", ["mki", "ki"])

def test_quintos_metric_matches_across_estimates(metric):

"""

For the quintos dataset, MKI/KI should be identical based

on the ordering of estimates.

"""

sample = ap.quintos_sample()

@pt.mark.parametrize("metric", ["mki", "ki"])

def test_quintos_metric_matches_across_estimates(metric, quintos_data_with_tiebreaks):

"""

For the quintos dataset, MKI/KI should be identical based

on the ordering of estimates.

"""

sample = quintos_data_with_tiebreaks

The change should be pretty similar if we stick with quintos_data -- we just wouldn't need the extra fixture definition in confest.py in that case, since the quintos_data fixture already exists.

Co-authored-by: Jean Cochrane <jeancochrane@users.noreply.github.com>

Damonamajor · 2025-09-11T21:25:38Z

+    def test_mki_tiebreaks_consistent(
+        self, metric, quintos_data_with_tiebreaks
+    ):
+        sale_price, estimate, estimate_alt_sort_1, estimate_alt_sort_2 = (


We can index, but this feels easier to interpret later.

jeancochrane

The new version of the test/fixture looks great! Some suggestions below, mostly just tweaks to documentation to make the purpose of this change clearer.

jeancochrane · 2025-09-12T19:02:32Z

+    df.sort_values(
+        by=["sale_price", "estimate"],
+        ascending=[True, False],
+        kind="mergesort",


[Question, optional] I wonder if this kwarg is still necessary? Per the pandas docs, kind is only used when sorting on a single column, but now we're sorting on two columns:

Choice of sorting algorithm. See also numpy.sort() for more information. mergesort and stable are the only stable algorithms. For DataFrames, this option is only applied when sorting on a single column or label.

I'm agnostic as to whether we should leave the kwarg in or take it out -- it doesn't seem to make anything worse to leave it in, and it could provide a layer of defensiveness preventing us from accidentally reintroducing an unstable sort if we ever decide to switch back to sorting on a single column -- but I'd be interested to see if the tests still pass when we take it out.

I assumed that it wouldn't affect anything. The only reason I left it in was if we ever wanted to do something with the dataset externally, it would remain the same. Maybe we wanted to look at class once sorted by MKI. That's not really a good example, but I could imagine something along these lines.

I expect it to pass even without it.

I'm fine leaving it in!

Co-authored-by: Jean Cochrane <jeancochrane@users.noreply.github.com>

Damonamajor added 5 commits February 26, 2025 17:17

Try adding second sorting for mki

7d048ed

Push test file

bf585fd

remove second mergesort

c60b5fb

update sort

8278bac

update quintos sample

9de2cb9

Damonamajor linked an issue Sep 2, 2025 that may be closed by this pull request

Ensure decimal place stability for MKI #33

Closed

Damonamajor added 6 commits September 2, 2025 18:23

Add test for mki ki matching

20451f2

comma based seed

e97c413

change to comma separated

d83c541

precommit

d8bd9cf

add matching test

7e518e0

precommit

3432b95

Damonamajor changed the title ~~33 investigate decimal place stability~~ Use worst case method for MKI / KI Sep 2, 2025

Remove excess code

0fe2336

Damonamajor commented Sep 2, 2025

View reviewed changes

Delete assesspy/test.py

6908bb2

Damonamajor commented Sep 2, 2025

View reviewed changes

Comment thread assesspy/tests/test_metrics.py Outdated

Damonamajor marked this pull request as ready for review September 2, 2025 21:11

Damonamajor requested a review from wrridgeway as a code owner September 2, 2025 21:11

Damonamajor self-assigned this Sep 3, 2025

Damonamajor added 2 commits September 3, 2025 10:19

rename test more accurately

50082a9

update name again

d491933

jeancochrane requested changes Sep 3, 2025

View reviewed changes

Damonamajor and others added 7 commits September 3, 2025 11:09

Update assesspy/tests/test_metrics.py

8477716

Co-authored-by: Jean Cochrane <jeancochrane@users.noreply.github.com>

Update assesspy/data/quintos_sample.csv

e49fb16

Co-authored-by: Jean Cochrane <jeancochrane@users.noreply.github.com>

use both csv files

4fb55e9

update test_metrics

f5be991

Include documentation

fe5b445

lintr

e67f915

set as a fixture

34e0b14

Damonamajor added 13 commits September 4, 2025 16:24

lintr

e7d25ac

make one fixture

1c8918c

make one test

0c5692b

re-add stray delete

3fcc03b

remove unneeded pandas

455085d

Add parametized test

aa7dd08

rename

c18b8ec

Add to conftest

a3b96a2

update conftest

31dc6eb

lintr

6574b2a

record commenting

db9823d

more commenting

53f52a0

move everything to test

24a07de

Damonamajor commented Sep 11, 2025

View reviewed changes

Damonamajor requested a review from jeancochrane September 11, 2025 21:25

Damonamajor added 2 commits September 11, 2025 16:26

Update load_data.py

bde3a77

Update test_metrics.py

132d803

jeancochrane approved these changes Sep 12, 2025

View reviewed changes

Damonamajor and others added 5 commits September 12, 2025 14:34

Update assesspy/load_data.py

5861e4b

Co-authored-by: Jean Cochrane <jeancochrane@users.noreply.github.com>

Update assesspy/load_data.py

7ab91cb

Co-authored-by: Jean Cochrane <jeancochrane@users.noreply.github.com>

Update assesspy/metrics.py

cb51fc4

Co-authored-by: Jean Cochrane <jeancochrane@users.noreply.github.com>

Update docs/source/quintos_sample_with_tiebreaks.rst

8bcdf41

Co-authored-by: Jean Cochrane <jeancochrane@users.noreply.github.com>

lintr

8d4d995

wrridgeway approved these changes Sep 12, 2025

View reviewed changes

Damonamajor merged commit 488cd1e into main Sep 12, 2025
8 checks passed

Damonamajor deleted the 33-investigate-decimal-place-stability branch September 12, 2025 20:19

	@pt.fixture
	def metric_val(self, metric, ccao_data, quintos_data):
	if metric in ["mki", "ki"]:
	return getattr(ap, metric)(*quintos_data)
	return getattr(ap, metric)(*ccao_data)

	@pt.fixture(scope="session")
	def quintos_data() -> tuple:
	sample = ap.quintos_sample()
	return sample.estimate, sample.sale_price

Conversation

Damonamajor commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Damonamajor Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jeancochrane left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jeancochrane Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

Damonamajor Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

jeancochrane left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

jeancochrane Sep 12, 2025

Choose a reason for hiding this comment

Uh oh!

Damonamajor Sep 12, 2025

Choose a reason for hiding this comment

Uh oh!

jeancochrane Sep 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Damonamajor commented Sep 2, 2025 •

edited

Loading