[ODSC-75899] : Add post-processing step for forecast clipping #1261

codeloop · 2025-09-07T10:15:56Z

Add post-processing step for forecast clipping

This change introduces a new post-processing step for the forecast operator that allows users to clip the forecast output to a specified minimum and maximum value. This is useful for ensuring that the forecast values remain within a realistic or desirable range.

This change includes:

A new PostprocessingSteps configuration to define the min/max clipping values.
Updates to the operator schema to expose the new configuration.
Implementation of the clipping logic using numpy.clip.
A new test case to verify the clipping functionality.

ahosler

The metrics need to be generated using the post-processing numbers. Right now they use the pre-processing numbers, which means the metrics are the same whether they post-process or not. Yet the report uses the post-processed numbers. That seems silly.

Do the clip, then generate all numbers and metrics.
And keep postprocessing in backtesting, that's important!

ahosler · 2025-09-18T17:05:02Z

ads/opctl/operator/lowcode/forecast/model/forecast_datasets.py

+            self.postprocessing.set_max_forecast,
+        )
+        if min_threshold is not None or max_threshold is not None:
+            np.clip(forecast_val, min_threshold, max_threshold, out=forecast_val)


Why use "out=forecast_val" instead of "forecast_val=..."?

ahosler · 2025-09-18T17:07:25Z

ads/opctl/operator/lowcode/forecast/model/prophet.py

            horizon=self.spec.horizon,
            target_column=self.original_target_column,
            dt_column=self.spec.datetime_column.name,
+            postprocessing=self.spec.postprocessing,


Do we need to pass this everywhere if it's already part of "self"?

ahosler · 2025-09-18T17:11:31Z

ads/opctl/operator/lowcode/forecast/model_evaluator.py

        backtest_spec["output_directory"] = {"url": output_file_path}
        backtest_spec["target_category_columns"] = [DataColumns.Series]
        backtest_spec["generate_explanations"] = False
+        backtest_spec.pop('postprocessing', None)


It's not clear to me why we'd want to pop this.
Don't we want to evaluate models based on which is giving the best end result? The post-processing is part of the end result.

I could imagine a case where the true values look like a sin function with a floor of 0. Backtesting may evaluate 2 options: a sin function and a const = sqrt(2).

Backtesting may favour the const function, even though the sin + post-processing would have a 0 MAPE.

I think we should keep postprocessing, but lets chat if you think otherwise.

ahosler · 2025-09-18T17:12:10Z

ads/opctl/operator/lowcode/forecast/model/forecast_datasets.py

    get_frequency_of_datetime,
 )

-from ..const import ForecastOutputColumns, SupportedModels, TROUBLESHOOTING_GUIDE


We need to standardize on 1 linter. It's hard to read these PRs with all this junk

ahosler · 2025-09-18T17:13:04Z

ads/opctl/operator/lowcode/forecast/model/forecast_datasets.py

+                        self,
+                        attr,
+                        pd.concat([val_self, val_other], ignore_index=True, axis=0),
+                    )


assuming this is just linter

ahosler · 2025-09-18T17:13:59Z

ads/opctl/operator/lowcode/forecast/operator_config.py

+            self.postprocessing
+            if self.postprocessing is not None
+            else PostprocessingSteps()
+        )


this is more lines than the simpler

if self.postprocessing is None:
self.postprocessing = PostprocessingSteps()

ahosler · 2025-09-18T17:15:30Z

ads/opctl/operator/lowcode/forecast/schema.yaml

+          type: integer
+          required: false
+          meta:
+            description: "This can be used to define the maximum forecast in the output."


How about
"Set a minimum value for the forecast" and "Set a maximum value for the forecast"

ahosler · 2025-09-18T17:18:52Z

ads/opctl/operator/lowcode/forecast/schema.yaml

+      type: dict
+      required: false
+      schema:
+        set_min_forecast:


how about just "min", "max"?
We don't use "set" anywhere ele and "forecast" is redundant

ahosler · 2025-09-18T17:26:26Z

ads/opctl/operator/lowcode/forecast/model/forecast_datasets.py

+            self.postprocessing.set_max_forecast,
+        )
+        if min_threshold is not None or max_threshold is not None:
+            np.clip(forecast_val, min_threshold, max_threshold, out=forecast_val)


Let's break this out into a separate method so it's easier to find in the future. But i love the simplicity of using clip here.

Add post-processing step for forecast clipping

4523a0d

oracle-contributor-agreement bot added the OCA Verified All contributors have signed the Oracle Contributor Agreement. label Sep 7, 2025

Add default model selection when no valid metrics are calculated

2cc43eb

codeloop requested a review from prasankh September 8, 2025 02:42

codeloop marked this pull request as ready for review September 8, 2025 02:42

codeloop requested review from darenr, mayoor, mrDzurb, VipulMascarenhas, qiuosier and ahosler as code owners September 8, 2025 02:42

codeloop enabled auto-merge September 8, 2025 02:42

codeloop changed the title ~~Add post-processing step for forecast clipping~~ [ODSC-75899] : Add post-processing step for forecast clipping Sep 8, 2025

This comment was marked as resolved.

Sign in to view

codeloop and others added 2 commits September 9, 2025 11:31

pop postprocessing param from backtest.

eec5ff0

docs: Update forecast operator schema documentation

d592c52

codeloop requested a review from prasankh September 9, 2025 06:15

Merge branch 'main' into vikaspa/postproc_max

b225c8c

prasankh approved these changes Sep 16, 2025

View reviewed changes

prasankh and others added 2 commits September 16, 2025 14:17

Merge branch 'main' into vikaspa/postproc_max

6e3a6c0

Merge branch 'main' into vikaspa/postproc_max

4b0adb4

ahosler requested changes Sep 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ODSC-75899] : Add post-processing step for forecast clipping #1261

[ODSC-75899] : Add post-processing step for forecast clipping #1261

Uh oh!

codeloop commented Sep 7, 2025

Uh oh!

This comment was marked as resolved.

Uh oh!

ahosler left a comment

Uh oh!

ahosler Sep 18, 2025

Uh oh!

ahosler Sep 18, 2025

Uh oh!

ahosler Sep 18, 2025

Uh oh!

ahosler Sep 18, 2025

Uh oh!

ahosler Sep 18, 2025

Uh oh!

ahosler Sep 18, 2025

Uh oh!

ahosler Sep 18, 2025

Uh oh!

ahosler Sep 18, 2025

Uh oh!

ahosler Sep 18, 2025

Uh oh!

Uh oh!

[ODSC-75899] : Add post-processing step for forecast clipping #1261

Are you sure you want to change the base?

[ODSC-75899] : Add post-processing step for forecast clipping #1261

Uh oh!

Conversation

codeloop commented Sep 7, 2025

Uh oh!

This comment was marked as resolved.

Uh oh!

ahosler left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!