[branch-4.1](variant) Skip full footer scan when constructing VariantStatsCaculator (#62819)#63072
Open
csun5285 wants to merge 1 commit intoapache:branch-4.1from
Open
[branch-4.1](variant) Skip full footer scan when constructing VariantStatsCaculator (#62819)#63072csun5285 wants to merge 1 commit intoapache:branch-4.1from
csun5285 wants to merge 1 commit intoapache:branch-4.1from
Conversation
…culator (apache#62819) In **vertical compaction**, `SegmentWriter::init()` is called multiple times against the same writer (key columns first, then each value-column group). The footer **accumulates** across calls, so every additional `init()` re-scans an ever-larger footer — including entries from prior `init()`s that the current `column_ids` cannot address. | | Before | After | |---|---|---| | Single construction | O(footer accumulated size) | O(this init's column_ids size) | | N vertical-compaction `init()` calls | **O(N²)** | **O(N)** | Snapshot `_footer.columns_size()` **before** `_create_writers` appends new entries and pass it to `VariantStatsCaculator` as `footer_column_offset`. The constructor only walks `[offset, end)` — its own slice. When performing compaction with 10,000 columns in the variant, this method accounts for around 8% of CPU usage. After changing it to O(N), this section has disappeared from the flame graph. (cherry picked from commit ffefcae)
Contributor
|
Thank you for your contribution to Apache Doris. Please clearly describe your PR:
|
Contributor
Author
|
run buildall |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
pick from master #62819