Add SPEED-Bench reference synthetic AL values for DeepSeek-V4-Pro MTP 1-8 by qiching · Pull Request #1592 · SemiAnalysisAI/InferenceX

qiching · 2026-05-30T07:11:06Z

Measured with SPEED-Bench coding dataset, temperature=1.0, thinking=true. Values used for synthetic acceptance rate configuration in MTP benchmarks.

AL Values

MTP	AL
1	1.90
2	2.60
3	2.97
4	3.04
5	3.13
6	3.08
7	3.13
8	3.12

Note

Low Risk
Documentation-style benchmark reference data only; no runtime or production code paths change.

Overview
Adds benchmarks/speedbench-reference-al.yaml with measured acceptance length (AL) targets for DeepSeek-V4-Pro at MTP levels 1–8, keyed by num_speculative_tokens.

Values come from SPEED-Bench (coding dataset, temperature 1.0, thinking on) and are intended as golden AL inputs for synthetic acceptance rate setup in MTP benchmarks.

^{Reviewed by Cursor Bugbot for commit 1de285d. Bugbot is set up for automated code reviews on this repo. Configure here.}

Measured with SPEED-Bench coding dataset, temperature=1.0, thinking=true. Values used for synthetic acceptance rate configuration in MTP benchmarks.

claude

Claude Code Review

This pull request is from a fork — automated review is disabled. A repository maintainer can comment @claude review to run a one-time review.

xinli-sw · 2026-05-30T07:18:28Z

@qiching , a few recommendations

title [1/N] Synthetic MTP - Add SPEED-Bench reference synthetic AL values for DeepSeek-V4-Pro MTP 1-8
In the PR description, mention that we will also have speedbench as part of github workflows, however, as the first iteration, we'd like to get alignment to make sure partners all feel confident and equally about these AL values
attach full repro for the numbers (serve command, installation, speedbench command, etc)
attach full results of the runs you had (the jsonl file) for audibility

Great work so far, cc @benchislett @functionstackx

Add SPEED-Bench reference AL values for DeepSeek-V4-Pro MTP 1-8

1de285d

Measured with SPEED-Bench coding dataset, temperature=1.0, thinking=true. Values used for synthetic acceptance rate configuration in MTP benchmarks.

qiching requested a review from a team May 30, 2026 07:11

github-project-automation Bot added this to InferenceMAX Board May 30, 2026

claude Bot reviewed May 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add SPEED-Bench reference synthetic AL values for DeepSeek-V4-Pro MTP 1-8#1592

Add SPEED-Bench reference synthetic AL values for DeepSeek-V4-Pro MTP 1-8#1592
qiching wants to merge 1 commit into
SemiAnalysisAI:mainfrom
qiching:albecheng/add-dsv4-reference-al

qiching commented May 30, 2026 •

edited by cursor Bot

Loading

Uh oh!

claude Bot left a comment

Uh oh!

xinli-sw commented May 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

qiching commented May 30, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

AL Values

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

Claude Code Review

Uh oh!

xinli-sw commented May 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

qiching commented May 30, 2026 •

edited by cursor Bot

Loading