Skip to content

Add CUDA graph capture/replay for qwen 3.5 moe decode method#18809

Draft
Gasoonjia wants to merge 23 commits intomainfrom
cuda-graph
Draft

Add CUDA graph capture/replay for qwen 3.5 moe decode method#18809
Gasoonjia wants to merge 23 commits intomainfrom
cuda-graph

Commits

Commits on Apr 2, 2026

Commits on Apr 3, 2026

Commits on Apr 5, 2026

Commits on Apr 6, 2026

Commits on Apr 7, 2026

Commits on Apr 9, 2026

Commits on Apr 10, 2026

Commits on Apr 13, 2026