[spark] Add merge-into.skip-file-pruning option by wzx140 · Pull Request #8065 · apache/paimon

wzx140 · 2026-06-01T13:31:25Z

Purpose

Add merge-into.skip-file-pruning for MergeInto partial column update on data-evolution tables. When enabled, this option skips the file-level pruning step. It is useful when most files in the target partition are expected to be updated, so the overhead of collecting touched file IDs outweighs the benefit of pruning untouched files.

When file pruning is skipped, Spark merge into still pushes down target-table partition filters from the MERGE ON condition to avoid scanning unrelated partitions.

Tests

Added RowTrackingTest cases for enabling/disabling merge-into.skip-file-pruning, including result correctness and file-pruning join behavior.
Added RowTrackingTest coverage for target partition filter pushdown from the MERGE ON condition when skip file pruning is enabled.

[spark] Add merge-into.skip-file-pruning option

9d97131

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[spark] Add merge-into.skip-file-pruning option#8065

[spark] Add merge-into.skip-file-pruning option#8065
wzx140 wants to merge 1 commit into
apache:masterfrom
wzx140:codex/merge-into-skip-file-pruning

wzx140 commented Jun 1, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

wzx140 commented Jun 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Tests

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

wzx140 commented Jun 1, 2026 •

edited

Loading