Fixes a loop in the GENERATE_SERIES function on boundary values. #8812

sim1984 · 2025-11-25T14:33:20Z

The GENERATE_SERIES function behaves incorrectly at boundary values for the BIGINT and INT128 types. A loop occurs with the BIGINT type, and an exception is thrown with the INT128 type. Examples of incorrect behavior:

select * from generate_series(-9223372036854775807, -9223372036854775809, -1) as x rows 10;

INPUT message field count: 0
 
OUTPUT message field count: 1
01: sqltype: 32752 INT128 scale: 0 subtype: 0 len: 16
  :  name: GENERATE_SERIES  alias: GENERATE_SERIES
  : table:   schema:   owner:
 
                              GENERATE_SERIES
=============================================
                         -9223372036854775807
                         -9223372036854775808
                         -9223372036854775809

(OK, expected)

select * from generate_series(-9223372036854775807, -9223372036854775808, -1) as x rows 10;

INPUT message field count: 0
 
OUTPUT message field count: 1
01: sqltype: 580 INT64 scale: 0 subtype: 0 len: 8
  :  name: GENERATE_SERIES  alias: GENERATE_SERIES
  : table:   schema:   owner:
 
      GENERATE_SERIES
=====================
 -9223372036854775807
 -9223372036854775808
  9223372036854775807 // <-- Error. 
  9223372036854775806
  9223372036854775805
  9223372036854775804
  9223372036854775803
  9223372036854775802
  9223372036854775801
  9223372036854775800
===============

For INT128 example:

select * from generate_series(bin_shl(cast(2 as int128),126)+1, bin_shl(cast(2 as int128),126), -1) as x rows 10;

INPUT message field count: 0

OUTPUT message field count: 1
01: sqltype: 32752 INT128 scale: 0 subtype: 0 len: 16
  :  name: GENERATE_SERIES  alias: GENERATE_SERIES
  : table:   schema:   owner:

                              GENERATE_SERIES
=============================================
     -170141183460469231731687303715884105727
Statement failed, SQLSTATE = 22003
arithmetic exception, numeric overflow, or string truncation
-Integer overflow.  The result of an integer operation caused the most significant bit of the result to carry.

src/jrd/recsrc/TableValueFunctionScan.cpp

asfernandes · 2025-11-26T00:23:41Z

src/jrd/recsrc/TableValueFunctionScan.cpp

-			result += step;
+			// Prevent freezing at boundary values
+			if (((step < 0) && (result == MIN_SINT64)) ||
+				((step > 0) && (result == MAX_SINT64)))


Is this logic correct for scaled numbers?

This logic doesn't work if step is different from 1 or -1. I'll rework this solution for a more general case.

Do these dynamic type switches needed? Why not count in output data type from the beginning?

Do these dynamic type switches needed? Why not count in output data type from the beginning?

The type is already chosen based on the most general of the three parameters. However, calculations with the INT128 type are handled differently than with INT64, so these are separate code paths.

Is this logic correct for scaled numbers?

This is a protection against internal type overflows. It's unlikely that you can specify scaled values at the function parameter level without causing an error sooner. At least these errors will be detected and won't simply lead to a loop.

Are there any significant advantages to this approach?

Not deal with different types every time.

The errors for me are correct, the step is applied and next the condition is checked. The overflow happens when applying the next step.

Although I admit that for a FOR RANGE LOOP, this error is perfectly logical. However, generate_seriers works a little differently. Such errors are definitely not expected there.

PostgreSQL example:

select * from generate_series(-9223372036854775807, -9223372036854775808, -1) as x;

|----------------------| | x | |----------------------| | -9223372036854775807 | | -9223372036854775808 |

select * from generate_series(-9223372036854775806, -9223372036854775807, -3) as x

|----------------------| | x | |----------------------| | -9223372036854775806 |

In my opinion, using ArithmeticNode::add to control computation boundaries is quite difficult. When a user uses the generate_series table function, they certainly don't think of it as a loop. And the goal of this PR was precisely to fix the loop/error issue where it shouldn't occur. A refactoring is possible in the future, if anyone deems it useful.

In my opinion, using ArithmeticNode::add to control computation boundaries is quite difficult. When a user uses the generate_series table function, they certainly don't think of it as a loop. And the goal of this PR was precisely to fix the loop/error issue where it shouldn't occur. A refactoring is possible in the future, if anyone deems it useful.

You have the option to enclose it in try-catch and swallow the error. It will do only in boundaries case.
Isn't it a good compromise to avoid extra verifications and custom implementations for different data types?

Okay, I'll try to do it this way.

@asfernandes

…now used as agreed upon by @asfernandes.

Fixes a loop in the GENERATE_SERIES function on boundary values.

a98863a

dyemanov reviewed Nov 25, 2025

View reviewed changes

Correction according to dyemanov

9c552d0

asfernandes reviewed Nov 26, 2025

View reviewed changes

sim1984 added 3 commits November 26, 2025 11:23

A more complete solution for taking into account boundary values

96f091d

Refactoring calculation with different types. ArithmeticNode::add is …

7996aef

…now used as agreed upon by @asfernandes.

Fixed non-ASCII character in variable name

f493687

sim1984 requested a review from asfernandes November 30, 2025 17:42

Uh oh!

Fixes a loop in the GENERATE_SERIES function on boundary values. #8812

Are you sure you want to change the base?

Fixes a loop in the GENERATE_SERIES function on boundary values. #8812

Conversation

sim1984 commented Nov 25, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sim1984 Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

sim1984 Nov 26, 2025 •

edited

Loading