-
Notifications
You must be signed in to change notification settings - Fork 742
Pull requests: PaddlePaddle/FastDeploy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Cherry-Pick][CI] Sync dev optimizations to release/online/20260415(#7602)
#7857
opened May 19, 2026 by
EmmonsCurse
Collaborator
Loading…
5 tasks done
[KVCache] Add free_cpu_block_num gauge metric
#7856
opened May 19, 2026 by
liyonghua0910
Collaborator
Loading…
2 of 5 tasks
[Cherry-Pick][KVCache] Support request-level prefix cache disable(#7854)
#7855
opened May 19, 2026 by
kevincheng2
Collaborator
Loading…
4 of 5 tasks
[KVCache] Support request-level prefix cache disable
#7854
opened May 19, 2026 by
kevincheng2
Collaborator
Loading…
3 of 5 tasks
[DataProcessor] Refactor and unify text/multimodal processor pipeline
#7853
opened May 19, 2026 by
luukunn
Collaborator
Loading…
3 of 5 tasks
Support Triton MLA Attention Backend
#7852
opened May 19, 2026 by
chang-wenbin
Collaborator
Loading…
5 tasks
[Cherry-Pick][Feature] support decode unified attention(#7688)
#7850
opened May 19, 2026 by
lizhenyun01
Collaborator
Loading…
5 tasks
[Speculative Decoding]【Hackathon 10th Spring No.54】hybrid_mtp_ngram 端到端验证
contributor
External developers
#7849
opened May 19, 2026 by
NKNaN
Contributor
Loading…
5 tasks done
[Cherry-Pick][Feature][Log]console metrics log for pd disaggregation #7843
#7845
opened May 18, 2026 by
CSWYF3634076
Collaborator
Loading…
5 tasks done
[Feature] Add server-level token length defaults and input token limit
#7842
opened May 18, 2026 by
luukunn
Collaborator
Loading…
3 of 5 tasks
[BugFix] Fix attention mask for multimodal models
#7841
opened May 18, 2026 by
TBD1
Collaborator
Loading…
2 of 5 tasks
[PD] PD send cache via storage & Refine swap_cache_layout op
#7839
opened May 17, 2026 by
juncaipeng
Collaborator
Loading…
1 of 5 tasks
support MLA overlap-schedule
#7836
opened May 15, 2026 by
chang-wenbin
Collaborator
Loading…
5 tasks
[unitest] small change in test_deepgemm_precision.py
#7834
opened May 15, 2026 by
zhoutianzi666
Collaborator
Loading…
5 tasks
Add inner benchmark metrics component
#7831
opened May 15, 2026 by
Deleter-D
Collaborator
Loading…
5 tasks
[Cherry-Pick][Loader] Add values natural order check to layers grouped validation
#7822
opened May 14, 2026 by
bukejiyu
Collaborator
Loading…
1 of 5 tasks
Revert "[PD] prepare request in prefill instance by multi threads"
#7821
opened May 14, 2026 by
Jiang-Jia-Jun
Collaborator
•
Draft
[Others] update flash mask version
#7819
opened May 14, 2026 by
BingooYang
Contributor
Loading…
5 tasks done
[Feature] Support FP4 communication quantization and dense block_wise_fp8 and moe nvfp4
#7817
opened May 14, 2026 by
lizexu123
Collaborator
Loading…
5 tasks
[Metax][CI] update metax ci
contributor
External developers
#7812
opened May 14, 2026 by
Tryorish
Contributor
Loading…
5 tasks
[bugfix] free blocks even if AS write failed
#7807
opened May 13, 2026 by
zccjjj
Contributor
Loading…
5 tasks
[Others]Benchmark compare skill
contributor
External developers
#7803
opened May 13, 2026 by
Linboyan-trc
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-05-16.