[spark] Harden dynamic overwrite against optimized child plans by kerwin-zk · Pull Request #8052 · apache/paimon

kerwin-zk · 2026-05-31T12:10:24Z

Purpose

PaimonDynamicPartitionOverwriteCommand exposes its child query to Spark optimizer through V2WriteCommand, but later wraps the same query back into a Dataset in run() before passing it to WriteIntoPaimonTable.This is fragile when the child query has already been optimized by Spark. The optimized plan may contain optimizer/planner-side placeholders, such as DynamicPruningSubquery, which are not ideal to expose again to writer-side Dataset operations.

This PR makes the command-to-writer boundary more robust for the dynamic partition overwrite fallback path. Before passing the query to WriteIntoPaimonTable, it converts the child query into an RDD-backed DataFrame via createNewDataFrame(createDataset(...)). As a result, the writer consumes a clean logical plan instead of directly consuming the possibly optimized child plan.

Tests

CI

leaves12138 · 2026-05-31T12:51:04Z

Thanks for the update. I am holding off on approval for now because the current CI run has a failing job and several jobs are still pending. Please fix or rerun the failed checks, then I can take another pass.

YannByron · 2026-06-01T03:22:05Z

+1. @kerwin-zk Thank you for this very-deep issue. Will merge when CI has passed.

[spark] Harden dynamic overwrite against optimized child plans

7c9d637

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[spark] Harden dynamic overwrite against optimized child plans#8052

[spark] Harden dynamic overwrite against optimized child plans#8052
kerwin-zk wants to merge 1 commit into
apache:masterfrom
kerwin-zk:spark-dynamic-overwrite-hardening

kerwin-zk commented May 31, 2026

Uh oh!

leaves12138 commented May 31, 2026

Uh oh!

YannByron commented Jun 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

kerwin-zk commented May 31, 2026

Purpose

Tests

Uh oh!

leaves12138 commented May 31, 2026

Uh oh!

YannByron commented Jun 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants