Skip to content

Conversation

@yeya24
Copy link
Contributor

@yeya24 yeya24 commented Jan 7, 2026

What this PR does:

This PR tries to store the number of shards in the parquet converter marker as well as in the bucket index. This supports sharded parquet conversion (we don't support it today) in both write path and read path. Read path can tell how many shards there are by looking at the parquet marker to know how many files to read.

Note that to make this PR small I only changed parquet queryable and left parquet store gateway untouched. Ideally, Store Gateway should load bucket index so that it is able to tell how many shards there are for parquet blocks. But today parquet store gateway doesn't sync bucket index at all.

The plan is to add more shard info to the parquet convert marker like min and max metric name for each shard so that we can prune the shards to query based on the metric name as our parquet file is sorted by metric name. That can leave for future implementation.

Which issue(s) this PR fixes:
Fixes #7175

Checklist

  • Tests updated
  • Documentation added
  • CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

@yeya24 yeya24 force-pushed the parquet-converter-shards branch from 3282ac3 to 8908b3e Compare January 7, 2026 01:36
Copy link
Member

@SungJin1212 SungJin1212 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@dosubot dosubot bot added the lgtm This PR has been approved by a maintainer label Jan 7, 2026
@yeya24 yeya24 merged commit ec124e5 into cortexproject:master Jan 8, 2026
74 of 77 checks passed
@yeya24 yeya24 deleted the parquet-converter-shards branch January 8, 2026 22:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

lgtm This PR has been approved by a maintainer size/L

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Parquet] Write parquet block shard size to parquet marker

2 participants