Skip to content

Pull requests: InternLM/lmdeploy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[WIP]: Support mtp + dp
#4611 opened May 21, 2026 by RunningLeon Collaborator Loading…
fix memleak when input contain large image data
#4610 opened May 21, 2026 by grimoire Collaborator Loading…
cancel in-progress runs when PR is updated or merged
#4609 opened May 21, 2026 by lvhan028 Collaborator Loading…
TEST: update qwen3.5 397b test
#4607 opened May 21, 2026 by littlegy Contributor Loading…
TEST: update video test
#4606 opened May 21, 2026 by littlegy Contributor Loading…
perf: optimize guided decoding with xgrammar upgrade, batched API, and async D2H overlap
#4605 opened May 21, 2026 by windreamer Collaborator Loading…
1 of 4 tasks
Remove state init improvement
#4604 opened May 20, 2026 by grimoire Collaborator Loading…
fix(vl): reduce multimodal feature memory use
#4603 opened May 20, 2026 by CUHKSZzxy Collaborator Loading…
support qwen3.5(vit) inference in turbomind backend enhancement New feature or request
#4602 opened May 20, 2026 by irexyc Collaborator Loading…
[ci] add k4v2 testcase and fix some fail cases
#4601 opened May 20, 2026 by zhulinJulia24 Collaborator Loading…
Intern s2 preview lite awq fix bug
#4600 opened May 19, 2026 by 43758726 Collaborator Loading…
[WIP]: Support reuse routed experts on eviction
#4599 opened May 19, 2026 by RunningLeon Collaborator Loading…
Refactor proxy server improvement
#4596 opened May 18, 2026 by lvhan028 Collaborator Draft
update anthropic endpoint test
#4594 opened May 18, 2026 by littlegy Contributor Loading…
docs(advance): add Add a New Speculative Decoding Method guide documentation Improvements or additions to documentation
#4589 opened May 17, 2026 by SuperMarioYL Loading…
4 tasks done
refactor ascend multinode
#4588 opened May 15, 2026 by yao-fengchen Collaborator Draft
Add OpenAI Responses-compatible endpoint enhancement New feature or request
#4582 opened May 13, 2026 by CUHKSZzxy Collaborator Loading…
[security] fix(proxy): require auth for node management
#4579 opened May 11, 2026 by Hinotoi-agent Loading…
5 of 9 tasks
feat: configure cudagraph capture batch sizes
#4573 opened May 8, 2026 by CUHKSZzxy Collaborator Draft
Fix health latency under concurrent VL request preparation Bug:P0
#4570 opened May 7, 2026 by CUHKSZzxy Collaborator Loading…
LLM evaluation skill on text datasets
#4566 opened Apr 30, 2026 by lvhan028 Collaborator Loading…
FP8 kv cache quantization enhancement New feature or request
#4563 opened Apr 29, 2026 by CUHKSZzxy Collaborator Loading…
[Feature] Add guided decoding support for speculative decoding enhancement New feature or request
#4559 opened Apr 28, 2026 by windreamer Collaborator Loading…
4 tasks done
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.