refactor: increased code sharing between CPU and GPU interpretation in RNTuple reading #1470

fstrug · 2025-07-17T14:02:18Z

Much of the code used to read RNTuple data via cpu and gpu interpretation is common but contained in separate functions. This means any changes made in one workflow will not be automatically reflected in the other. This pr increases function sharing between the two interpretation modes such that any code improvements are automatically shared. A FieldClusterMetadata class has been added to contain metadata used in the reading and deserializing of raw page data.

I have also changed the arrays() argument use_GDS to interpreter since it is not strictly necessary to have GDS support for gpu interpretation of RNTuple data and is more reflective of what the argument is controlling.

…tation of RNTuple data. Changed arrays() argument 'use_GDS' to 'interpreter'.

ianna

@fstrug - nice! Please check some minor comments. Thanks.

src/uproot/behaviors/RNTuple.py

Co-authored-by: Ianna Osborne <[email protected]>

ariostas

Thank you, Frank! This looks great to me. I like how you factored out some pieces to make the code cleaner.

The only comment I have is that I'm not very sure about the interpreter keyword argument, since I think it's a bit ambiguous. It might make sense to implement decompression_executor and interpretation_executor like how is done for TTrees, but I'm not familiar with those, so I'd have to look into that. So that can just be a follow-up.

At some point we'll have to sit down and fix all the keyword arguments for the common methods (arrays, iterate, etc). But I think it's still okay if we change some of the lesser used keyword arguments while we figure things out and people are not regularly working with RNTuples.

fstrug changed the title ~~fix: increased code sharing between CPU and GPU interpretation of RNTuple reading~~ fix: increased code sharing between CPU and GPU interpretation in RNTuple reading Jul 17, 2025

Code refactoring. Increased function sharing for CPU and GPU interpre…

dc8fd04

…tation of RNTuple data. Changed arrays() argument 'use_GDS' to 'interpreter'.

ariostas force-pushed the main branch from 77574ff to dc8fd04 Compare July 23, 2025 16:56

ianna added the next-release Required for the next release label Jul 24, 2025

ianna requested changes Jul 24, 2025

View reviewed changes

src/uproot/behaviors/RNTuple.py Outdated Show resolved Hide resolved

src/uproot/behaviors/RNTuple.py Outdated Show resolved Hide resolved

src/uproot/behaviors/RNTuple.py Outdated Show resolved Hide resolved

fstrug and others added 3 commits July 24, 2025 17:40

Update src/uproot/behaviors/RNTuple.py

e364ecb

Co-authored-by: Ianna Osborne <[email protected]>

Update src/uproot/behaviors/RNTuple.py

6676dfa

Co-authored-by: Ianna Osborne <[email protected]>

Update RNTuple.py

2f6cce8

fstrug changed the title ~~fix: increased code sharing between CPU and GPU interpretation in RNTuple reading~~ refactor: increased code sharing between CPU and GPU interpretation in RNTuple reading Jul 24, 2025

ariostas approved these changes Jul 25, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor: increased code sharing between CPU and GPU interpretation in RNTuple reading #1470

refactor: increased code sharing between CPU and GPU interpretation in RNTuple reading #1470

fstrug commented Jul 17, 2025

Uh oh!

ianna left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ariostas left a comment

Uh oh!

Uh oh!

refactor: increased code sharing between CPU and GPU interpretation in RNTuple reading #1470

Are you sure you want to change the base?

refactor: increased code sharing between CPU and GPU interpretation in RNTuple reading #1470

Conversation

fstrug commented Jul 17, 2025

Uh oh!

ianna left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ariostas left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!