perf[buffer]: iteration for fallible operations with validity by joseph-isaacs · Pull Request #8120 · vortex-data/vortex

joseph-isaacs · 2026-05-27T09:59:24Z

Currently use (and arrow) handle fallible operations with scalar (non-SIMD) code.

This PR add a trait and methods to have fast SIMD checked operations (includes cast) but verified else where that checked_add benefits

codspeed-hq · 2026-05-27T10:12:35Z

Merging this PR will improve performance by 16.14%

⚠️

Unknown Walltime execution environment detected

Using the Walltime instrument on standard Hosted Runners will lead to inconsistent data.

For the most accurate results, we recommend using CodSpeed Macro Runners: bare-metal machines fine-tuned for performance measurement consistency.

⚡ 6 improved benchmarks
✅ 1259 untouched benchmarks
🆕 10 new benchmarks
⏩ 1 skipped benchmark¹

Performance Changes

	Mode	Benchmark	`BASE`	`HEAD`	Efficiency
🆕	Simulation	`cast_i32_to_u32[65536]`	N/A	832.9 µs	N/A
🆕	Simulation	`cast_u32_to_u8[65536]`	N/A	250.5 µs	N/A
🆕	Simulation	`cast_u16_to_u32[65536]`	N/A	210.6 µs	N/A
⚡	Simulation	`patched_take_10k_dispersed`	316.3 µs	286 µs	+10.61%
⚡	Simulation	`patched_take_10k_first_chunk_only`	302.6 µs	272.3 µs	+11.14%
⚡	Simulation	`patched_take_10k_adversarial`	257.2 µs	226.9 µs	+13.37%
⚡	Simulation	`take_10k_dispersed`	284.8 µs	239.8 µs	+18.76%
⚡	Simulation	`take_10k_first_chunk_only`	271.1 µs	226.2 µs	+19.86%
🆕	Simulation	`map_with_mask_widen_u16_u32[65536]`	N/A	189.6 µs	N/A
🆕	Simulation	`try_map_masked_into_widen_u16_u32[65536]`	N/A	190 µs	N/A
🆕	Simulation	`try_map_into_narrow_u64_u32[65536]`	N/A	424.1 µs	N/A
🆕	Simulation	`try_map_masked_into_narrow_i32_u32[65536]`	N/A	292.3 µs	N/A
🆕	Simulation	`try_map_masked_in_place_narrow_i32_u32[65536]`	N/A	172.7 µs	N/A
🆕	Simulation	`map_with_mask_narrow_u64_u32[65536]`	N/A	387.1 µs	N/A
🆕	Simulation	`lanezip_checked_add_u32[65536]`	N/A	452.7 µs	N/A
⚡	Simulation	`bitwise_not_vortex_buffer_mut[128]`	304.4 ns	246.1 ns	+23.7%

Tip

Curious why this is faster? Comment @codspeedbot explain why this is faster on this PR, or directly use the CodSpeed MCP with your agent.

_{Comparing ji/fast-iter-valid (fc9b5e8) with develop (a2323f1)}

1 benchmark was skipped, so the baseline result was used instead. If it was deleted from the codebase, click here and archive it to remove it from the performance reports. ↩

Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>

joseph-isaacs · 2026-05-27T15:16:16Z

Open question is where to put this code?

Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>

robert3005 · 2026-05-27T23:14:22Z

Sounds like we want a crate in between the array and vortex-buffer or this could be a feature flag in vortex-buffer

joseph-isaacs added 10 commits May 27, 2026 15:25

wip

7b5828f

Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>

wip

85ef2f8

Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>

wip

5cf469a

Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>

f

502a286

Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>

f

2f6df63

Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>

f

769a258

Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>

f

3a30290

Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>

f

d2bca93

Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>

f

6fd7fc1

Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>

f

72bca8b

Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>

joseph-isaacs force-pushed the ji/fast-iter-valid branch from 4b444dd to 72bca8b Compare May 27, 2026 14:25

joseph-isaacs added 3 commits May 27, 2026 15:44

f

fe34ccb

Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>

f

4299cf0

Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>

f

8e5945f

Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>

joseph-isaacs changed the title ~~faster iteration infra~~ perf[buffer]: iteration for fallible operations with validity May 27, 2026

joseph-isaacs marked this pull request as ready for review May 27, 2026 15:13

joseph-isaacs added 8 commits May 27, 2026 16:58

f

e9aac1d

Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>

f

d8d5463

Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>

f

2556d53

Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>

f

aa8a6d1

Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>

f

ca2ad88

Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>

f

608111c

Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>

f

d0a7806

Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>

f

fc9b5e8

Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>

joseph-isaacs added the changelog/performance A performance improvement label May 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf[buffer]: iteration for fallible operations with validity#8120

perf[buffer]: iteration for fallible operations with validity#8120
joseph-isaacs wants to merge 21 commits into
developfrom
ji/fast-iter-valid

joseph-isaacs commented May 27, 2026 •

edited

Loading

Uh oh!

codspeed-hq Bot commented May 27, 2026 •

edited

Loading

Uh oh!

joseph-isaacs commented May 27, 2026

Uh oh!

robert3005 commented May 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

joseph-isaacs commented May 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codspeed-hq Bot commented May 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merging this PR will improve performance by 16.14%

Performance Changes

Footnotes

Uh oh!

joseph-isaacs commented May 27, 2026

Uh oh!

robert3005 commented May 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

joseph-isaacs commented May 27, 2026 •

edited

Loading

codspeed-hq Bot commented May 27, 2026 •

edited

Loading