[SPARK-54566][SQL] Enhance SparkSql FileSourceScanExec for easy integration #53281

jinchengchenghh · 2025-12-02T12:08:49Z

As this issue apache/incubator-gluten#11216 described in Gluten, this file https://github.com/apache/incubator-gluten/blob/main/shims/spark40/src/main/scala/org/apache/spark/sql/execution/AbstractFileSourceScanExec.scala is mostly a copy from FileSourceScanExec.

Because the source class provides inputRDD but the columnar engine only needs the finalPartitions.

What changes were proposed in this pull request?

Extract filePartitions from FileSourceScanExec inputRDDs to FileSourceScanLike filePartitions

Why are the changes needed?

The columnar engine needs the file partition information

Does this PR introduce any user-facing change?

No

How was this patch tested?

Existing tests can cover.

Was this patch authored or co-authored using generative AI tooling?

No

…ration

github-actions bot added the SQL label Dec 2, 2025

jinchengchenghh mentioned this pull request Dec 2, 2025

[CORE] Refactor in Spark to make shims and test easier integration apache/incubator-gluten#11216

Open

[SPARK-54566][SQL] Enhance SparkSql FileSourceScanExec for easy integ…

85b6df0

…ration

jinchengchenghh force-pushed the refactor branch from 55183ec to 85b6df0 Compare December 2, 2025 18:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-54566][SQL] Enhance SparkSql FileSourceScanExec for easy integration #53281

[SPARK-54566][SQL] Enhance SparkSql FileSourceScanExec for easy integration #53281

jinchengchenghh commented Dec 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[SPARK-54566][SQL] Enhance SparkSql FileSourceScanExec for easy integration #53281

Are you sure you want to change the base?

[SPARK-54566][SQL] Enhance SparkSql FileSourceScanExec for easy integration #53281

Conversation

jinchengchenghh commented Dec 2, 2025

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant