perf: add specializations for mapreduce min/max #48

pfackeldey · 2025-07-31T13:06:18Z

This PR add specializations for mapreduce(maximum, max, V) and mapreduce(minimum, min, V) to directly dispatch to maximum(V.data) and minimum(V.data) and thus avoiding allocating intermediates.

codecov · 2025-07-31T13:15:58Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 96.21%. Comparing base (021083f) to head (2dd33ca).

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #48      +/-   ##
==========================================
+ Coverage   96.20%   96.21%   +0.01%     
==========================================
  Files           8        8              
  Lines         527      529       +2     
==========================================
+ Hits          507      509       +2     
  Misses         20       20

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

src/vector_of_arrays.jl

oschulz · 2025-07-31T13:23:25Z

Thanks @pfackeldey ! Could you add this specialization for ArrayOfSimilarArrays too?

Also,, should we add specializations for Base.mapreduce(::typeof(sum), ::typeof(+), ...)?

Moelf · 2025-07-31T13:25:40Z

ArrayOfSimilarArrays

it would be useful to have a table at the top of documentation to show how the types are related to each other.

Is this type disjoint from ArrayOfArrays?

Also,, should we add specializations for Base.mapreduce(::typeof(sum), ::typeof(+), ...)?

yeah seems useful...

oschulz · 2025-07-31T13:35:06Z

it would be useful to have a table at the top of documentation to show how the types are related to each other.

Yes, good idea!

Is this type disjoint from ArrayOfArrays

We don't have a type ArrayOfArrays in the package, but ArrayOfSimilarArrays and VectorOfArrays are disjoint.

Co-authored-by: Oliver Schulz <[email protected]>

oschulz · 2025-07-31T14:49:35Z

src/array_of_similar_arrays.jl

+Base.mapreduce(::typeof(maximum), ::typeof(max), A::ArrayOfSimilarArrays; kw...) = maximum(flatview(A); kw...)
+Base.mapreduce(::typeof(minimum), ::typeof(min), A::ArrayOfSimilarArrays; kw...) = minimum(flatview(A); kw...)


Does it make sense to forward the kwargs to minimum here? The kwargs of mapreduce and minimum are not compatible, right?

yeah technically they don't completely overlap, however, the problem is we need init because other wise there's no way to make reducing over empty arrays work

would you in this case rather curry minimum to a minimum(..., init=0) and put this in mapreduce? and that hopefully still works because the typeof(minimum) would still trigger on the curried function?

I don't know how this would look like in Julia, but in Python it would be something like the following pseudo-code:

minimum_with_init = partial(minimum, init=0) mapreduce(minimum_with_init, min, A)

I'm not sure how multiple dispatch works with curried functions, and not sure if this is a common solution/paradigm in Julia.

I think it might be best to declare and handle the kwargs explicitly.

As for the minimum with init, I doubt many users would write their code like that ...

I think it might be best to declare and handle the kwargs explicitly.

there's no good default for init=, so if user doesn't provide one, we also don't

done in ffe6030

On generic arrays the default implementation of mapreduce is

mapreduce(f, op, A::AbstractArrayOrBroadcasted; dims=:, init=_InitialValue()) = _mapreduce_dim(f, op, init, A, dims)

but on GPU arrays it is

Base.mapreduce(f, op, A::AnyGPUArray, As::AbstractArrayOrBroadcasted...; dims=:, init=nothing)

So handling of the init default differs, and _InitialValue is not public. it would be nice of there was an official "default init" construct, but there isn't, so we have to get a bit creative.

Also, we can't just pass a dims kwarg that is not : through to the aggregation function on the flattened collection, it would result in incorrect results in many cases.

I suggest we do something like this (untested!):

using Base: add_sum, mul_prod for (f, op) in [ (:sum, :add_sum), (:product, :mul_prod), (:maximum, :max), (:minimum, :min) ] for AoA in [:AbstractArrayOfSimilarArrays, :AbstractVectorOfArrays] @eval begin Base.mapreduce(::typeof($f), ::typeof($op), A::$AoA; dims = :, init = nothing) = _aoa_associative_mapreduce($f, $op, $AoA, init) end end end _aoa_associative_mapreduce(f, ::Any, A, ::Colon, ::Nothing) = f(flatview(A)) _aoa_associative_mapreduce(f, ::Any, A, ::Colon, init) = f(flatview(A); init = init) function _aoa_associative_mapreduce(f, op, A, dims, init) = # Wrap `f` to dispatch to generic mapreduce implementation for `dims!=:`: wrapped_f = x -> f(x) return maprecduce_impl(x -> f(x), op, A, dims = dims, init = init) end

@oschulz I think we're gonna give up making PR now 😂

Well, we have to do it somewhat cleanly, esp. the dims handling.

there's no dims to handle if we're just specializing VectorOfX, because vector is 1D

let's do that first I guess

oschulz · 2025-08-08T07:25:03Z

@pfackeldey can you enable "allow maintainer to push to this branch"?

pfackeldey · 2025-08-08T12:18:23Z

@pfackeldey can you enable "allow maintainer to push to this branch"?

Should be already, see also: #50 (comment)

perf: add specializations for mapreduce min/max

2362826

oschulz requested changes Jul 31, 2025

View reviewed changes

src/vector_of_arrays.jl Outdated Show resolved Hide resolved

pfackeldey and others added 2 commits July 31, 2025 09:36

Update src/vector_of_arrays.jl

2dd33ca

Co-authored-by: Oliver Schulz <[email protected]>

add specialization also to ArrayOfSimilarArrays

06f36bb

oschulz requested changes Jul 31, 2025

View reviewed changes

pfackeldey added 2 commits July 31, 2025 10:59

improve tests

387574c

remove kwargs forwarding

ffe6030

oschulz mentioned this pull request Aug 23, 2025

perf: add 0-copy reduce-vcat specialization for VectorOfVectors #50

Open

		Base.mapreduce(::typeof(maximum), ::typeof(max), A::ArrayOfSimilarArrays; kw...) = maximum(flatview(A); kw...)
		Base.mapreduce(::typeof(minimum), ::typeof(min), A::ArrayOfSimilarArrays; kw...) = minimum(flatview(A); kw...)

perf: add specializations for mapreduce min/max #48

Are you sure you want to change the base?

perf: add specializations for mapreduce min/max #48

Uh oh!

Conversation

pfackeldey commented Jul 31, 2025

Uh oh!

codecov bot commented Jul 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

oschulz commented Jul 31, 2025

Uh oh!

Moelf commented Jul 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

oschulz commented Jul 31, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

oschulz commented Aug 8, 2025

Uh oh!

pfackeldey commented Aug 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov bot commented Jul 31, 2025 •

edited

Loading

Moelf commented Jul 31, 2025 •

edited

Loading