Support multiple replica sizes per scale in cluster spec sheet by antiguru · Pull Request #36604 · MaterializeInc/materialize

antiguru · 2026-05-18T09:10:57Z

Motivation

The cluster spec sheet benchmarking tool currently tests only a single replica size per scale. This change enables testing multiple replica sizes at each scale, allowing side-by-side comparison of equivalent cluster configurations (e.g., Ncc vs M.1-* sizes with the same worker count).

Description

This PR refactors the replica size selection logic to support multiple sizes per scale:

API Change: Renamed replica_size_for_scale() to replica_sizes_for_scale() which now returns list[str] instead of a single string. Added a new replica_size_for_scale() convenience method that returns the first (default) size for backward compatibility with setup-only clusters.
ManagedTarget Implementation: Updated to return both Ncc and equivalent M.1-* sizes where available. Added M1_REPLICA_SIZES mapping that pairs each scale with its corresponding M.1 tier based on worker count equivalence (e.g., scale 2 → M.1-small with 4 workers).
DockerTarget Implementation: Updated to return a single-element list for consistency.
Scenario Execution: Modified run_scenario_strong() and run_scenario_weak() to iterate over all returned replica sizes, running each scenario once per size. This allows direct performance comparison between equivalent cluster configurations.
Analysis: Enhanced analyze_cluster_results_file() with M.1 size metadata (credits/hour and worker counts) to properly normalize results across different cluster types when analyzing benchmark data.

Verification

The changes maintain backward compatibility through the new replica_size_for_scale() convenience method. Existing tests and benchmarks will continue to work. The new functionality is exercised by the scenario execution loops which now iterate over multiple sizes per scale.

https://claude.ai/code/session_016ELD471W47RyJdWuoLrUc1

… cc sizes CloudTarget now exposes a list of equivalent (cc, M.1) sizes per replica scale, and the strong/weak scaling runners iterate over both so each scale point produces a data point on both pricing schemes. The cluster result analyzer learns to map M.1 names to credits/hour and worker counts for plotting alongside cc sizes.

antiguru marked this pull request as ready for review May 18, 2026 09:14

antiguru requested review from aljoscha, def- and ggevay and removed request for ggevay May 18, 2026 09:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support multiple replica sizes per scale in cluster spec sheet#36604

Support multiple replica sizes per scale in cluster spec sheet#36604
antiguru wants to merge 1 commit into
mainfrom
claude/restore-m1-cluster-sizes-0g84M

antiguru commented May 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

antiguru commented May 18, 2026

Motivation

Description

Verification

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants