Skip to content

Conversation

@horakivo
Copy link
Member

@horakivo horakivo commented Jan 10, 2025

Recently, many Python libraries have integrated Apache Arrow to leverage its high-performance and memory-efficient format for handling large datasets. Given the widespread adoption of Arrow in data science and big data ecosystems, we are adding Apache Arrow support for Series to GraalPy.

This is the simple demo which uses Java Apache implementation library together with GraalPy achieving zero-copy data transfer for Series.

@oracle-contributor-agreement
Copy link

Thank you for your pull request and welcome to our community! To contribute, please sign the Oracle Contributor Agreement (OCA).
The following contributors of this PR have not signed the OCA:

  • PR author: horakivo

To sign the OCA, please create an Oracle account and sign the OCA in Oracle's Contributor Agreement Application.

When signing the OCA, please provide your GitHub username. After signing the OCA and getting an OCA approval from Oracle, this PR will be automatically updated.

If you are an Oracle employee, please make sure that you are a member of the main Oracle GitHub organization, and your membership in this organization is public.

@oracle-contributor-agreement oracle-contributor-agreement bot added the OCA Required At least one contributor does not have an approved Oracle Contributor Agreement. label Jan 10, 2025
@timfel
Copy link
Contributor

timfel commented Feb 11, 2025

FYI, I extended this demo to use a 0.5GB CSV file and compared with and without the Arrow integration:
image

@horakivo horakivo force-pushed the ih/graalpy-apache-arrow-guide branch 2 times, most recently from a1a392c to 7836676 Compare February 14, 2025 10:49
@horakivo horakivo force-pushed the ih/graalpy-apache-arrow-guide branch from 7836676 to dfaa139 Compare February 14, 2025 10:49
@oracle-contributor-agreement oracle-contributor-agreement bot added OCA Verified All contributors have signed the Oracle Contributor Agreement. and removed OCA Required At least one contributor does not have an approved Oracle Contributor Agreement. labels Mar 18, 2025
@horakivo horakivo self-assigned this Mar 18, 2025
@horakivo horakivo force-pushed the ih/graalpy-apache-arrow-guide branch 5 times, most recently from 4ed22aa to 3c64bfe Compare March 18, 2025 15:05
@horakivo horakivo force-pushed the ih/graalpy-apache-arrow-guide branch from 3c64bfe to 9ccf2ec Compare March 18, 2025 15:07
@timfel timfel requested review from fniephaus and steve-s June 12, 2025 09:05
@horakivo horakivo force-pushed the ih/graalpy-apache-arrow-guide branch from bb8b225 to f750807 Compare June 26, 2025 08:08
@horakivo horakivo force-pushed the ih/graalpy-apache-arrow-guide branch from f750807 to 1731eab Compare June 26, 2025 08:42
steve-s
steve-s previously approved these changes Jun 27, 2025
Copy link
Contributor

@steve-s steve-s left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we should add a CI job for this. Can be a follow up PR.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

OCA Verified All contributors have signed the Oracle Contributor Agreement.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants