GH-32007 [Python] Support arithmetic on arrays and scalars #48085

rmnskb · 2025-11-08T00:59:43Z

Rationale for this change

Please see #32007, currently, neither arrays nor scalars support Python-native arithmetic operations, such as array + array, it has to be done via pyarrow.compute API. This PR strives to fix this with custom dunder methods.

What changes are included in this PR?

Implemented dunder methods

Are these changes tested?

Yes

Are there any user-facing changes?

Possibility to use Python operators directly instead of calling the pyarrow.compute API.

GitHub Issue: [Python] Support arithmetic on arrays and scalars #32007

rmnskb · 2025-11-08T01:08:52Z

@pitrou @AlenkaF @rok
Sorry for pinging like this, I wanted to understand the scope correctly. So far, I went with the most straightforward implementation and added dunders for all operations that were listed in the compute docs and had respective dunders available. Do you think this implementation is enough or should I go for other dunders as well, e.g bitwise operations? After reviewing the original discussion, I've seen @jorisvandenbossche suggesting checked versions of the functions, I think I will go with them as well.
Regarding the documentation:

Should I add docstrings to dunder methods? Given that user won't see them in their IDE, for example.
Should the documentation for arrays/scalars be updated to reflect that the native operators are now supported?
Thanks for your input!

rok · 2025-11-11T17:46:54Z

Sorry for late reply @rmnskb. I think all dunders that semantically map to our existing kernels are fair game! I'm not sure what these would be, but the set you have now looks decently sized.

Should I add docstrings to dunder methods? Given that user won't see them in their IDE, for example.

We might be able to add docstrings via type annotations. Or perhaps it's possible to copy them at runtime from the respective kernels? (unsure that is possible). In any case we probably don't want to duplicate docstrings in general.

Should the documentation for arrays/scalars be updated to reflect that the native operators are now supported?

Yes, that would be good to have!

rmnskb · 2025-11-11T18:37:42Z

Sorry for late reply @rmnskb. I think all dunders that semantically map to our existing kernels are fair game! I'm not sure what these would be, but the set you have now looks decently sized.

Ok, I was also thinking about covering as many dunder methods as possible, but did not want to go out of scope for this issue.

We might be able to add docstrings via type annotations. Or perhaps it's possible to copy them at runtime from the respective kernels? (unsure that is possible). In any case we probably don't want to duplicate docstrings in general.

Copying the docstrings sounds like a good idea, I will look further into that.

AlenkaF · 2025-11-12T08:17:37Z

Thank you for working on this @rmnskb and sorry for replying late!
Pinging us when you have a question is totally fine and in fact necessary ;)

I think the current scope is solid. I am thinking of basic comparisons and the discussion in the issue around __eq__. That would also be good to tackle but maybe as a separate PR?

Yes, it would be important to update the documentation (User Guide which is the part that is not API reference related).

One comment after looking at the code would be to add tests that cover various type combinations (array, scalar, Python types, unsupported types ...)

rok · 2025-11-12T10:33:50Z

I think the current scope is solid. I am thinking of basic comparisons and the discussion in the issue around __eq__. That would also be good to tackle but maybe as a separate PR?

I think we already have __eq__: https://github.com/apache/arrow/pull/7737/files.

One comment after looking at the code would be to add tests that cover various type combinations (array, scalar, Python types, unsupported types ...)

That's a good point, __eq__ currently raises if the types of compared don't match. Good test coverage would also help find edge cases.

    def __eq__(self, other):
        try:
            return self.equals(other)
        except TypeError:
            # This also handles comparing with None
            # as Array.equals(None) raises a TypeError.
            return NotImplemented

AlenkaF · 2025-11-12T11:26:11Z

I think we already have __eq__: https://github.com/apache/arrow/pull/7737/files.

Yes, true. I was under the impression that we might want to change the use of equals() method with the equal compute function? From the issue:

I think if we add those, we should also change the behaviour of __eq__, although that is something that will require a long deprecation cycle.

rok · 2025-11-12T12:25:51Z

Oh, I managed to forget this and thought array.equals dispatches to compute.equals already 🤦.

It seems like a good opportunity to change this! :)

rmnskb · 2025-11-13T21:17:14Z

We might be able to add docstrings via type annotations. Or perhaps it's possible to copy them at runtime from the respective kernels? (unsure that is possible). In any case we probably don't want to duplicate docstrings in general.

I've looked more into this, and based on these two discussions, we'd have to copy them directly from the underlying functions at runtime via importing the pyarrow.compute module, which AFAIK we don't have access to, since we call them with _pc().call_function().
I've added the docstrings to respective classes which notes that the API now support calling the Python-native operators directly, as well as provides some examples of that. Please let me know if that's sufficient.

rmnskb · 2025-11-13T21:22:16Z

I think the current scope is solid. I am thinking of basic comparisons and the discussion in the issue around __eq__. That would also be good to tackle but maybe as a separate PR?

I'll be happy to work on it, once I'm done with this implementation :)

Yes, it would be important to update the documentation (User Guide which is the part that is not API reference related).

Are you talking about this file? Or is there somewhere else, where I can put this information?

One comment after looking at the code would be to add tests that cover various type combinations (array, scalar, Python types, unsupported types ...)

Yes, I agree. Are there any particular unsupported types I should include that you have in mind?

AlenkaF · 2025-11-14T10:59:12Z

I'll be happy to work on it, once I'm done with this implementation :)

Thank you!

Are you talking about this file? Or is there somewhere else, where I can put this information?

I was thinking more about the compute page in our User Guide: https://github.com/apache/arrow/blob/main/docs/source/python/compute.rst

Are there any particular unsupported types I should include that you have in mind?

No, not really. What comes to mind are strings and/or nested for some arithmetic functions.

rmnskb · 2025-11-14T14:48:31Z

Are you talking about this file? Or is there somewhere else, where I can put this information?

I was thinking more about the compute page in our User Guide: https://github.com/apache/arrow/blob/main/docs/source/python/compute.rst

I mentioned the newly implemented operators in the documentation. Please let me know if it makes sense. I'm not sure whether we should list all the implemented methods, on the other hand, I don't want to leave users guessing what exactly can they use.

rok · 2025-11-17T09:32:07Z

One comment after looking at the code would be to add tests that cover various type combinations (array, scalar, Python types, unsupported types ...)

Yes, I agree. Are there any particular unsupported types I should include that you have in mind?

An integer based extension type maybe.

rmnskb added 2 commits November 8, 2025 01:50

Add arithmetic dunders for pa.Array

295bee0

Cover pa.Array dunders with a test

d8bd508

github-actions bot added Component: Python awaiting review Awaiting review labels Nov 8, 2025

rmnskb added 3 commits November 9, 2025 22:43

Check the compute functions to their checked counterparts

e9219b9

Add arithmetic dunders to scalars

74aabc3

Cover scalar arithmetic dunders with tests

14d727e

rmnskb added 4 commits November 12, 2025 21:54

Add bitwise and math dunders to array

76f031e

Add bitwise and math dunders to scalar

9ac3b49

Cover newly added dunders with tests

bc7aafc

Add docstrings for arrays and scalar dunders

3d99be6

rmnskb added 2 commits November 14, 2025 13:08

Fix docstrings errors

9ce8cd8

Mention the implemented operators in the docs

1110f09

github-actions bot added the Component: Documentation label Nov 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

GH-32007 [Python] Support arithmetic on arrays and scalars #48085

GH-32007 [Python] Support arithmetic on arrays and scalars #48085

rmnskb commented Nov 8, 2025 •

edited by github-actions bot

Loading

Uh oh!

rmnskb commented Nov 8, 2025

Uh oh!

rok commented Nov 11, 2025

Uh oh!

rmnskb commented Nov 11, 2025

Uh oh!

AlenkaF commented Nov 12, 2025

Uh oh!

rok commented Nov 12, 2025

Uh oh!

AlenkaF commented Nov 12, 2025

Uh oh!

rok commented Nov 12, 2025

Uh oh!

rmnskb commented Nov 13, 2025

Uh oh!

rmnskb commented Nov 13, 2025

Uh oh!

AlenkaF commented Nov 14, 2025

Uh oh!

rmnskb commented Nov 14, 2025

Uh oh!

rok commented Nov 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

GH-32007 [Python] Support arithmetic on arrays and scalars #48085

Are you sure you want to change the base?

GH-32007 [Python] Support arithmetic on arrays and scalars #48085

Conversation

rmnskb commented Nov 8, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

rmnskb commented Nov 8, 2025

Uh oh!

rok commented Nov 11, 2025

Uh oh!

rmnskb commented Nov 11, 2025

Uh oh!

AlenkaF commented Nov 12, 2025

Uh oh!

rok commented Nov 12, 2025

Uh oh!

AlenkaF commented Nov 12, 2025

Uh oh!

rok commented Nov 12, 2025

Uh oh!

rmnskb commented Nov 13, 2025

Uh oh!

rmnskb commented Nov 13, 2025

Uh oh!

AlenkaF commented Nov 14, 2025

Uh oh!

rmnskb commented Nov 14, 2025

Uh oh!

rok commented Nov 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rmnskb commented Nov 8, 2025 •

edited by github-actions bot

Loading