Skip to content

Conversation

@jinchengchenghh
Copy link

As this issue apache/incubator-gluten#11216 described in Gluten, this file https://github.com/apache/incubator-gluten/blob/main/shims/spark40/src/main/scala/org/apache/spark/sql/execution/AbstractFileSourceScanExec.scala is mostly a copy from FileSourceScanExec.

Because the source class provides inputRDD but the columnar engine only needs the finalPartitions.

What changes were proposed in this pull request?

Extract filePartitions from FileSourceScanExec inputRDDs to FileSourceScanLike filePartitions

Why are the changes needed?

The columnar engine needs the file partition information

Does this PR introduce any user-facing change?

No

How was this patch tested?

Existing tests can cover.

Was this patch authored or co-authored using generative AI tooling?

No

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant