Skip to content

chore: bump iceberg-rust dependency to latest [iceberg]#3606

Merged
mbutrovich merged 10 commits intoapache:mainfrom
mbutrovich:iceberg-rust-update
Feb 27, 2026
Merged

chore: bump iceberg-rust dependency to latest [iceberg]#3606
mbutrovich merged 10 commits intoapache:mainfrom
mbutrovich:iceberg-rust-update

Conversation

@mbutrovich
Copy link
Contributor

@mbutrovich mbutrovich commented Feb 26, 2026

Which issue does this PR close?

N/A.

Rationale for this change

Take advantage of upstream reader performance optimizations:

What changes are included in this PR?

  • Bump iceberg dependency in Cargo.toml
  • Add Parquet metadata size hint that defaults to DataFusion's
  • Serialize file sizes for Iceberg data files and delete files. Unfortunately, this surfaces an Iceberg bug (Missing size rewrite in rewrite_table_path for delete file iceberg#12554) so we still have to issue a stat() on delete files. I am fixing in upstream Iceberg

How are these changes tested?

Existing tests. I also ran IcebergReadFromS3Suite locally (since it's not run in CI) to verify that reading a delete file's size in an object store works.

@mbutrovich
Copy link
Contributor Author

Let's get DF52 in first, then I'll jump to an even newer iceberg-rust commit since they just picked up DF52 as well.

@mbutrovich mbutrovich marked this pull request as draft February 26, 2026 21:40
@mbutrovich mbutrovich marked this pull request as ready for review February 27, 2026 16:27
@mbutrovich mbutrovich requested a review from comphead February 27, 2026 16:29
.with_batch_size(batch_size)
.with_data_file_concurrency_limit(self.data_file_concurrency_limit)
.with_row_selection_enabled(true)
.with_metadata_size_hint(512 * 1024) // Same as DataFusion's default
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we prob need this to be defined by user via sprk params, but can be done later

Copy link
Contributor

@comphead comphead left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @mbutrovich CI failing but not likely because of this changes

@mbutrovich mbutrovich merged commit 1a4eef6 into apache:main Feb 27, 2026
284 of 289 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants