Implement specified timeout for slow doctests #39746

user202729 · 2025-03-20T10:12:38Z

Fixes #39569 . Now a single doctest may add # long time (limit 100s) to set the time limit, if the actual time taken is below that then no warning will be raised.

I have added a few such comments as demonstration, but not all of them are added.

Also add some doctest and show the time taken on GitHub annotation, for convenience. (hopefully someone would look at it once the false positive/noise are dealt with…)

📝 Checklist

The title is concise and informative.
The description explains in detail what this PR is about.
I have linked a relevant issue or discussion.
I have created tests covering the changes.
I have updated the documentation and checked the documentation preview.

⌛ Dependencies

github-actions · 2025-03-20T14:07:52Z

Documentation preview for this PR (built with commit ae9f5a7; changes) is ready! 🎉
This preview will update shortly after each push to this PR.

user202729 · 2025-03-21T05:02:42Z

Note that https://doc.sagemath.org/html/en/developer/coding_basics.html#special-markup-to-influence-doctests requires # long time for tests that requires > 1 second to run, so someone would need to run --warn-long 1 and add the tag in a future pull request. Note that if it takes less than around 20 seconds then you should probably not use this feature, since the default time limit of # long time (without anything else in parentheses) is 30 seconds.

…marker

orlitzky · 2025-07-06T13:54:46Z

I just left a comment on #39569, but will summarize it here by saying that I think we should prefer to fix these tests rather than hide the (accurate) warnings.

According to our documentation, even long tests should complete in about 5s. If we have a test that takes 100s, it's a problem and should be dealt with. In #36226 I switched the runtime calculation to use CPU time for a more objective measure, and we lowered the warning threshold, though it is still far more lenient than the 1s and 5s recommendations in the developer guide. The whole point was to shine light upon the tests that are too slow, rather than me having to find them one at a time on my laptop when the doctests time out and create false positives. (The PR was started before we moved to Github, but now the CI has the same problem with random timeouts.)

Adding the framework for and exceptions to all of these tests will just revert us back to where we were -- with no warnings for tests that are in gross violation of our policy -- while requiring more code to do it. If there really are tests that require something like 100s to complete and there's no faster way to exercise the same code paths, then some other solution is called for (pre-release tests)?

user202729 · 2025-07-06T17:05:40Z

In theory, that's obviously the right thing to do.

In practice, suppose that it takes one year to clean up all these pull requests. Within that time, let's say we have to collectively review 1000 pull requests. The overhead of repeatedly looking at 1000 × 40 warnings can easily leads to people not looking at the warnings newly-introduced by the pull request and make the problem worse.

Adding # long time (limit Xs, issue 12345) doesn't mean it isn't an issue. It just mean that it is a known bug and we don't want to see the same thing for the upcoming pull requests.

(The alternative is you volunteer to quickly fix them, then yes, problem solved)

For comparison, there are 51 # known bug in the code base at the moment (and 5000+ issues in the repository). Obviously someone ought to fix them, but we don't want to see 51 failures × 6 platforms every pull request either.

orlitzky · 2025-07-06T19:02:16Z

For comparison, there are 51 # known bug in the code base at the moment (and 5000+ issues in the repository). Obviously someone ought to fix them, but we don't want to see 51 failures × 6 platforms every pull request either.

The problem with this analogy is that # known bug makes the test failures go away but # long time (limit Xs) does not. The tests still get run, still take forever, still cause timeouts, and still cause the test suite to fail -- a bigger problem than having to look at warnings.

40 is not a huge number. I was regularly fixing them, but it was futile before the CPU time branch was merged because if you make it possible to ignore the warnings, people ignore the warnings. Most slow tests were added because the author had a fast CPU, was testing on an unloaded system, and simply didn't realize it was slow. In those cases a smaller n, or a simpler field, or... can be used to speed up the test.

Many random tests are slow -- I was just looking at one of these in sage/geometry/cone.py that I am guilty of adding myself. The test is needed to exercise a particular code path, but the code doesn't actually need to be random. I can find a seed that happens to trigger the desired code path while terminating quickly, and then set_random_seed() before the test. I can't promise that every test will be easy to fix or that I'll understand the maths necessary to do it, but in my experience it is much much harder to find reviewers for the PRs than it is to fix the tests.

user202729 · 2025-07-06T21:59:33Z

40 is not a huge number

sure, nice.

I can't promise that every test will be easy to fix or that I'll understand the maths necessary to do it, but in my experience it is much much harder to find reviewers for the PRs than it is to fix the tests.

sounds like we have a problem... still, I suppose you can fix the test first and we see what to do later. Worst case CI fix is an option.

Most slow tests were added because the author had a fast CPU, was testing on an unloaded system, and simply didn't realize it was slow.

isn't this quite trivial i.e. the ratio of speed of any two CPU in existence at a given time is Θ(1)?

before the CPU time branch was merged

what's the relation here? or you mean the pull request also has the extra feature of raising the warning (?)

if you make it possible to ignore the warnings, people ignore the warnings

we're agreeing on this (the current situation is that there are too many warnings, which leads to people ignore them)

tobiasdiez · 2025-07-06T23:15:23Z

What about simply not adding the github annotation for the known 40 too long tests and tracking those instead in a new issue? Would prevent people from adding new too long tests without annoying anyone of too many unrelated warnings.

user202729 · 2025-07-07T00:55:16Z

What about simply not adding the github annotation for the known 40 too long tests and tracking those instead in a new issue? Would prevent people from adding new too long tests without annoying anyone of too many unrelated warnings.

this is exactly the same as # long time (limit Xs), no?

tobiasdiez · 2025-07-07T08:22:14Z

Not quite, since this long time with specified time is quite easy to add - and it's not clear to an average dev that it should not be used. On the other hand, a hard coded list somewhere in the doctester with a big warning header is less obvious and provides a better education.

Alternatively, we could also just tag these tests as "known bug".

user202729 · 2025-07-07T08:42:16Z

temporarily replace all tests that take too long with # known bug (see :issue:39569) (disadvantage: their correctness is then no longer tested)

@DaveWitteMorris complains about this option in #39569 .

a hard coded list somewhere in the doctester

like the baseline failure json? The disadvantage of having the list far from code is that when the code is fixed, nobody remembers to remove the entry from the list.

If it's just for educational purpose, we can either

add a
- I have not introduced any more # long time (limit Xs) in this pull request
to pull request template, or
change the wording ("limit Xs") to something more dangerous. (it's just a regex)

tobiasdiez · 2025-07-07T09:37:12Z

a hard coded list somewhere in the doctester

like the baseline failure json? The disadvantage of having the list far from code is that when the code is fixed, nobody remembers to remove the entry from the list.

Yes, but perhaps just a hard-coded array in the doctester would be sufficient in this case. I don't think it would be that bad in this case if the entry is not immediately removed from the list.

I share the sentiment that these warnings are a bit annoying and thus people just ignore them. But I don't have a strong opinion about the best way forward.

…marker

orlitzky · 2025-07-07T11:29:50Z

Can we at least eliminate the low-hanging fruit first? If a random test is slow, # long time (limit Xs) is going to be hard to get right, because the Xs is random. We have a few examples where foo.TestSuite() is slow, but that's an obvious candidate for pytest rather than a doctest because it doesn't demonstrate anything useful.

In fact, now that I type it, moving any tests that are hard to fix into pytest would be a nice interim solution. It eliminates the warnings, sidesteps the timeout issues, and we could add comments like "if you want this back in the documentation you have to speed it up first." But it's not even clear yet which ones would be hard to fix.

…marker

user202729 · 2025-08-02T23:13:11Z

easier said than done though (looks like most of the low-hanging fruits are addressed now). Sometimes there's just a single function taking no parameter whatsoever (so you cannot reduce it), and is still slow

for example this one:

G = graphs.shortened_000_111_extended_binary_Golay_code_graph()   # 25 s

(in reality it sometimes takes a little more than 30s)

it's an example, not a test, so you can't just "move to pytest" either. cf. #40443

…marker

orlitzky · 2025-08-03T22:49:03Z

easier said than done though (looks like most of the low-hanging fruits are addressed now). Sometimes there's just a single function taking no parameter whatsoever (so you cannot reduce it), and is still slow

for example this one:
G = graphs.shortened_000_111_extended_binary_Golay_code_graph()   # 25 s
(in reality it sometimes takes a little more than 30s)

it's an example, not a test, so you can't just "move to pytest" either. cf. #40443

This function is a constant. I can construct the same graph in much less time:

sage: %timeit -c H = Graph([G.vertices(), G.edges()], format="vertices_and_edges")
1.02 s ± 1.46 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

(and can similarly speed up most functions that take no arguments). We can pickle the vertices/edges as python ints, lists, and tuples -- and then load them if the user wants to construct this graph. For a test (suitable for pytest) we could then verify that the algorithm produces a graph isomorphic to the pickled one. This leaves the doctest example untouched, but running 25x faster.

sagemathgh-40558: Add long time marker to several slow tests This gets rid of about half of the warnings, until someone figure out whether they're intended to be slow, or how they can be sped up. Reference: sagemath#39569, sagemath#39746 ### 📝 Checklist  - [ ] The title is concise and informative. - [ ] The description explains in detail what this PR is about. - [ ] I have linked a relevant issue or discussion. - [ ] I have created tests covering the changes. - [ ] I have updated the documentation and checked the documentation preview. ### ⌛ Dependencies    URL: sagemath#40558 Reported by: user202729 Reviewer(s): Michael Orlitzky, user202729

github-actions bot added the s: needs review label Mar 20, 2025

user202729 force-pushed the long-time-extra-marker branch from e76243e to c10ad41 Compare March 20, 2025 10:34

Implement specified timeout for slow doctests

9958d74

user202729 force-pushed the long-time-extra-marker branch from c10ad41 to f7e0cc0 Compare March 20, 2025 12:18

user202729 added 2 commits March 21, 2025 12:03

Add some long time markers

8baef0b

Show time taken in GitHub annotation

ea574a8

user202729 force-pushed the long-time-extra-marker branch from ee7707a to ea574a8 Compare March 21, 2025 05:03

Merge remote-tracking branch 'upstream/develop' into long-time-extra-…

8a9c661

…marker

user202729 marked this pull request as draft March 27, 2025 17:12

github-actions bot removed the s: needs review label Mar 27, 2025

user202729 added 8 commits April 19, 2025 13:14

Merge remote-tracking branch 'upstream/develop' into long-time-extra-…

a030707

…marker

Merge remote-tracking branch 'upstream/develop' into long-time-extra-…

dd0d5da

…marker

Merge remote-tracking branch 'upstream/develop' into long-time-extra-…

51f4395

…marker

Merge remote-tracking branch 'upstream/develop' into long-time-extra-…

b904bff

…marker

Merge remote-tracking branch 'upstream/develop' into long-time-extra-…

3361d12

…marker

Merge remote-tracking branch 'upstream/develop' into long-time-extra-…

a47d317

…marker

Change time limit format

218a859

Add some more long time markers

994d028

user202729 marked this pull request as ready for review July 5, 2025 11:21

github-actions bot added the s: needs review label Jul 5, 2025

user202729 requested a review from tobiasdiez July 5, 2025 12:06

tobiasdiez requested a review from orlitzky July 6, 2025 12:33

Merge remote-tracking branch 'upstream/develop' into long-time-extra-…

5d616d8

…marker

Merge remote-tracking branch 'upstream/develop' into long-time-extra-…

3cf3a2b

…marker

user202729 mentioned this pull request Jul 28, 2025

Show long time warnings as GitHub annotations #40413

Merged

5 tasks

Merge remote-tracking branch 'upstream/develop' into long-time-extra-…

ae9f5a7

…marker

user202729 mentioned this pull request Aug 9, 2025

Add long time marker to several slow tests #40558

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Implement specified timeout for slow doctests #39746

Implement specified timeout for slow doctests #39746

Uh oh!

user202729 commented Mar 20, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Mar 20, 2025 •

edited

Loading

Uh oh!

user202729 commented Mar 21, 2025

Uh oh!

orlitzky commented Jul 6, 2025

Uh oh!

user202729 commented Jul 6, 2025 •

edited

Loading

Uh oh!

orlitzky commented Jul 6, 2025

Uh oh!

user202729 commented Jul 6, 2025

Uh oh!

tobiasdiez commented Jul 6, 2025

Uh oh!

user202729 commented Jul 7, 2025

Uh oh!

tobiasdiez commented Jul 7, 2025

Uh oh!

user202729 commented Jul 7, 2025 •

edited

Loading

Uh oh!

tobiasdiez commented Jul 7, 2025

Uh oh!

orlitzky commented Jul 7, 2025

Uh oh!

user202729 commented Aug 2, 2025 •

edited

Loading

Uh oh!

orlitzky commented Aug 3, 2025

Uh oh!

Uh oh!

Uh oh!

Implement specified timeout for slow doctests #39746

Are you sure you want to change the base?

Implement specified timeout for slow doctests #39746

Uh oh!

Conversation

user202729 commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📝 Checklist

⌛ Dependencies

Uh oh!

github-actions bot commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

user202729 commented Mar 21, 2025

Uh oh!

orlitzky commented Jul 6, 2025

Uh oh!

user202729 commented Jul 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

orlitzky commented Jul 6, 2025

Uh oh!

user202729 commented Jul 6, 2025

Uh oh!

tobiasdiez commented Jul 6, 2025

Uh oh!

user202729 commented Jul 7, 2025

Uh oh!

tobiasdiez commented Jul 7, 2025

Uh oh!

user202729 commented Jul 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tobiasdiez commented Jul 7, 2025

Uh oh!

orlitzky commented Jul 7, 2025

Uh oh!

user202729 commented Aug 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

orlitzky commented Aug 3, 2025

Uh oh!

Uh oh!

user202729 commented Mar 20, 2025 •

edited

Loading

github-actions bot commented Mar 20, 2025 •

edited

Loading

user202729 commented Jul 6, 2025 •

edited

Loading

user202729 commented Jul 7, 2025 •

edited

Loading

user202729 commented Aug 2, 2025 •

edited

Loading