Update evaluation logic for dashboard support #62

prateekdesai04 · 2023-10-23T20:16:24Z

Description of changes:
This PR handles the case where if multiple cleaned CSVs having been run on different folds are being evaluated.
Initially evaluation was only possible if all were using same number of folds.
This sets the folds to the least of all the cleaned CSVs being evaluated.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

Innixma · 2023-10-23T21:03:10Z

src/autogluon/bench/eval/evaluation/benchmark_evaluator.py

+        dataframes = []
+        for path in paths:
+            path = path if is_s3_url(path) else os.path.join(self.results_dir_input, path)
+            dataframe = pd.read_csv(path)
+            dataframes.append(dataframe)
+        # Discarding extra folds
+        min_num_rows = min(len(df) for df in dataframes)
+        trimmed_dataframes = [df[:min_num_rows] for df in dataframes]
+        return pd.concat(trimmed_dataframes, ignore_index=True, sort=True)


This will not discard extra folds properly. Please add a unit test and separate out the filtering logic so it is not hard-coded into the load_results_raw method.

Not all DataFrames loaded will have the same number of methods or datasets, so trimming by length of rows will not work.

We don't want to always filter extra folds. This should be a post-load operation that is optional.

You are assuming the input file is sorted by fold. This is not a valid assumption.

Innixma

Refer to above comment

suzhoum · 2023-10-23T21:04:23Z

src/autogluon/bench/eval/evaluation/benchmark_evaluator.py

+            dataframe = pd.read_csv(path)
+            dataframes.append(dataframe)
+        # Discarding extra folds
+        min_num_rows = min(len(df) for df in dataframes)


What if there are multiple datasets in results file? min() will not do what it's intended right?

Update eval logic

3c0d432

prateekdesai04 requested review from Innixma, suzhoum and tonyhoo October 23, 2023 20:45

tonyhoo approved these changes Oct 23, 2023

View reviewed changes

Innixma reviewed Oct 23, 2023

View reviewed changes

Innixma requested changes Oct 23, 2023

View reviewed changes

suzhoum reviewed Oct 23, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update evaluation logic for dashboard support #62

Update evaluation logic for dashboard support #62

Uh oh!

prateekdesai04 commented Oct 23, 2023 •

edited

Loading

Uh oh!

Innixma Oct 23, 2023 •

edited

Loading

Uh oh!

Innixma left a comment

Uh oh!

suzhoum Oct 23, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Update evaluation logic for dashboard support #62

Are you sure you want to change the base?

Update evaluation logic for dashboard support #62

Uh oh!

Conversation

prateekdesai04 commented Oct 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Innixma Oct 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Innixma left a comment

Choose a reason for hiding this comment

Uh oh!

suzhoum Oct 23, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

prateekdesai04 commented Oct 23, 2023 •

edited

Loading

Innixma Oct 23, 2023 •

edited

Loading