Improve memory load for map_complete_blocks #6730

stephenworsley · 2025-09-29T21:11:22Z

🚀 Pull Request

Closes #3808.

CLAassistant · 2025-09-29T21:11:29Z

All committers have signed the CLA.

for more information, see https://pre-commit.ci

…mplete_memory # Conflicts: # lib/iris/_lazy_data.py

codecov · 2025-10-07T08:52:20Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 90.25%. Comparing base (fa6d61d) to head (bb60e9e).
⚠️ Report is 2 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #6730      +/-   ##
==========================================
- Coverage   90.29%   90.25%   -0.05%     
==========================================
  Files          91       91              
  Lines       24475    24631     +156     
  Branches     4571     4609      +38     
==========================================
+ Hits        22100    22230     +130     
- Misses       1607     1624      +17     
- Partials      768      777       +9

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

for more information, see https://pre-commit.ci

…mplete_memory

pp-mo

Looks good !
Made some very minor suggestions for clarity.

pp-mo · 2025-10-22T18:46:45Z

lib/iris/_lazy_data.py

+        df = [False] * len(max_outchunks)
+        for dim in dims:
+            df[dim] = True
+        df = tuple(df)


I think this can be done more tidily

Suggested change

df = [False] * len(max_outchunks)

for dim in dims:

df[dim] = True

df = tuple(df)

df = tuple(i in dims for i in range(len(shape)))

pp-mo · 2025-10-22T22:55:04Z

lib/iris/_lazy_data.py

+    -----
+    .. note:
+
+        If the output chunks would larger than the maximum chunksize set


Tiny typo

Suggested change

If the output chunks would larger than the maximum chunksize set

If the output chunks would be larger than the maximum chunksize set

pp-mo · 2025-10-22T23:28:12Z

lib/iris/tests/unit/lazy_data/test_map_complete_blocks.py

+            # Note that one chunk is irregularly rechunked and the other isn't.
+            expected_chunk = (2, 2, 1, 2, 2)
+            assert result.chunks[1] == expected_chunk


I think this would be clearer if you checked chunks in both the rechunked dims

Suggested change

# Note that one chunk is irregularly rechunked and the other isn't.

expected_chunk = (2, 2, 1, 2, 2)

assert result.chunks[1] == expected_chunk

# Note that one chunk is irregularly rechunked and the other isn't.

assert result.chunks[0] == (1, 1, 1, 1, 1)

assert result.chunks[1] == (2, 2, 1, 2, 2) # split from the original chunks of (5, 4)

pp-mo · 2025-10-22T23:29:12Z

lib/iris/tests/unit/lazy_data/test_map_complete_blocks.py

+
+        result = map_complete_blocks(
+            cube, self.func, dims=(2, 3), out_sizes=(30, 40), dtype=lazy_array.dtype
+        )
+        assert is_lazy_data(result)


I don't think this bit is contributing anything much, as we don't check any properties of the result, except that it is lazy.
I think we can remove this, and only check the main result (i.e. the rechunked one).

Suggested change

result = map_complete_blocks(

cube, self.func, dims=(2, 3), out_sizes=(30, 40), dtype=lazy_array.dtype

)

assert is_lazy_data(result)

#(nothing)

pp-mo · 2025-10-22T23:32:07Z

lib/iris/tests/unit/lazy_data/test_map_complete_blocks.py

+        # Reduce the optimum dask chunksize.
+        with dask.config.set({"array.chunk-size": "32KiB"}):
+            result = map_complete_blocks(
+                cube, self.func, dims=(2, 3), out_sizes=(30, 40), dtype=lazy_array.dtype


I think it would be a more convincing example if the "fixed" dims weren't at the end of the shape.

I think you can easily transpose the example so the initial content of
da.ones((5, 9, 10, 10), chunks=(2, 5, 10, 5))
==> da.ones((5, 10, 9, 10), chunks=(2, 10, 5, 5))
And here in the map call we use dims=(1, 3).

(N.B. I did try this, and it does actually seem to work!)

improve memory for map_complete_blocks

198f6ca

[pre-commit.ci] auto fixes from pre-commit.com hooks

acd68d6

for more information, see https://pre-commit.ci

scitools-ci bot added this to 🚴 Peloton Sep 30, 2025

stephenworsley added 2 commits October 7, 2025 09:38

fix test failures

bf4548a

Merge remote-tracking branch 'origin/map_complete_memory' into map_co…

f23a711

…mplete_memory # Conflicts: # lib/iris/_lazy_data.py

stephenworsley added 3 commits October 16, 2025 21:53

add test

a1fd8a1

Merge remote-tracking branch 'upstream/main' into map_complete_memory

501be7c

add whatsnew

fdbfaab

stephenworsley marked this pull request as ready for review October 16, 2025 21:05

pre-commit-ci bot and others added 3 commits October 16, 2025 21:06

[pre-commit.ci] auto fixes from pre-commit.com hooks

7467fc2

for more information, see https://pre-commit.ci

remove import

8fab000

Merge remote-tracking branch 'origin/map_complete_memory' into map_co…

bb60e9e

…mplete_memory

pp-mo requested changes Oct 22, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve memory load for map_complete_blocks #6730

Improve memory load for map_complete_blocks #6730

stephenworsley commented Sep 29, 2025 •

edited

Loading

Uh oh!

CLAassistant commented Sep 29, 2025 •

edited

Loading

Uh oh!

codecov bot commented Oct 7, 2025 •

edited

Loading

Uh oh!

pp-mo left a comment

Uh oh!

pp-mo Oct 22, 2025 •

edited

Loading

Uh oh!

pp-mo Oct 22, 2025

Uh oh!

pp-mo Oct 22, 2025 •

edited

Loading

Uh oh!

pp-mo Oct 22, 2025 •

edited

Loading

Uh oh!

pp-mo Oct 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	If the output chunks would larger than the maximum chunksize set
	If the output chunks would be larger than the maximum chunksize set

Improve memory load for map_complete_blocks #6730

Are you sure you want to change the base?

Improve memory load for map_complete_blocks #6730

Conversation

stephenworsley commented Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🚀 Pull Request

Uh oh!

CLAassistant commented Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

pp-mo left a comment

Choose a reason for hiding this comment

Uh oh!

pp-mo Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pp-mo Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

pp-mo Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pp-mo Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pp-mo Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

stephenworsley commented Sep 29, 2025 •

edited

Loading

CLAassistant commented Sep 29, 2025 •

edited

Loading

codecov bot commented Oct 7, 2025 •

edited

Loading

pp-mo Oct 22, 2025 •

edited

Loading

pp-mo Oct 22, 2025 •

edited

Loading

pp-mo Oct 22, 2025 •

edited

Loading