Skip to content

Pull requests: PaddlePaddle/FastDeploy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Cherry-Pick][CI] Sync dev optimizations to release/online/20260415(#7602)
#7857 opened May 19, 2026 by EmmonsCurse Collaborator Loading…
5 tasks done
[KVCache] Add free_cpu_block_num gauge metric
#7856 opened May 19, 2026 by liyonghua0910 Collaborator Loading…
2 of 5 tasks
[Cherry-Pick][KVCache] Support request-level prefix cache disable(#7854)
#7855 opened May 19, 2026 by kevincheng2 Collaborator Loading…
4 of 5 tasks
[KVCache] Support request-level prefix cache disable
#7854 opened May 19, 2026 by kevincheng2 Collaborator Loading…
3 of 5 tasks
[DataProcessor] Refactor and unify text/multimodal processor pipeline
#7853 opened May 19, 2026 by luukunn Collaborator Loading…
3 of 5 tasks
Support Triton MLA Attention Backend
#7852 opened May 19, 2026 by chang-wenbin Collaborator Loading…
5 tasks
[Cherry-Pick][Feature] support decode unified attention(#7688)
#7850 opened May 19, 2026 by lizhenyun01 Collaborator Loading…
5 tasks
[Speculative Decoding]【Hackathon 10th Spring No.54】hybrid_mtp_ngram 端到端验证 contributor External developers
#7849 opened May 19, 2026 by NKNaN Contributor Loading…
5 tasks done
[Cherry-Pick][Feature][Log]console metrics log for pd disaggregation #7843
#7845 opened May 18, 2026 by CSWYF3634076 Collaborator Loading…
5 tasks done
[XPU] fix zmq err catch
#7844 opened May 18, 2026 by cmcamdy Collaborator Loading…
5 tasks
[Feature] Add server-level token length defaults and input token limit
#7842 opened May 18, 2026 by luukunn Collaborator Loading…
3 of 5 tasks
[BugFix] Fix attention mask for multimodal models
#7841 opened May 18, 2026 by TBD1 Collaborator Loading…
2 of 5 tasks
[PD] PD send cache via storage & Refine swap_cache_layout op
#7839 opened May 17, 2026 by juncaipeng Collaborator Loading…
1 of 5 tasks
support MLA overlap-schedule
#7836 opened May 15, 2026 by chang-wenbin Collaborator Loading…
5 tasks
[unitest] small change in test_deepgemm_precision.py
#7834 opened May 15, 2026 by zhoutianzi666 Collaborator Loading…
5 tasks
Add inner benchmark metrics component
#7831 opened May 15, 2026 by Deleter-D Collaborator Loading…
5 tasks
[Cherry-Pick][Loader] Add values natural order check to layers grouped validation
#7822 opened May 14, 2026 by bukejiyu Collaborator Loading…
1 of 5 tasks
[Others] update flash mask version
#7819 opened May 14, 2026 by BingooYang Contributor Loading…
5 tasks done
[Metax][CI] update metax ci contributor External developers
#7812 opened May 14, 2026 by Tryorish Contributor Loading…
5 tasks
[Feature] GPU Model Runner V1
#7810 opened May 13, 2026 by ming1753 Collaborator Draft
5 tasks
[bugfix] free blocks even if AS write failed
#7807 opened May 13, 2026 by zccjjj Contributor Loading…
5 tasks
Triton mla
#7804 opened May 13, 2026 by Linboyan-trc Loading…
[Others]Benchmark compare skill contributor External developers
#7803 opened May 13, 2026 by Linboyan-trc Loading…
ProTip! Updated in the last three days: updated:>2026-05-16.