-
Notifications
You must be signed in to change notification settings - Fork 698
Pull requests: InternLM/lmdeploy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix memleak when input contain large image data
#4610
opened May 21, 2026 by
grimoire
Collaborator
Loading…
cancel in-progress runs when PR is updated or merged
#4609
opened May 21, 2026 by
lvhan028
Collaborator
Loading…
perf: optimize guided decoding with xgrammar upgrade, batched API, and async D2H overlap
#4605
opened May 21, 2026 by
windreamer
Collaborator
Loading…
1 of 4 tasks
fix(vl): reduce multimodal feature memory use
#4603
opened May 20, 2026 by
CUHKSZzxy
Collaborator
Loading…
support qwen3.5(vit) inference in turbomind backend
enhancement
New feature or request
#4602
opened May 20, 2026 by
irexyc
Collaborator
Loading…
[ci] add k4v2 testcase and fix some fail cases
#4601
opened May 20, 2026 by
zhulinJulia24
Collaborator
Loading…
[WIP]: Support reuse routed experts on eviction
#4599
opened May 19, 2026 by
RunningLeon
Collaborator
Loading…
Extend v1/messages by introducing token-in/out and returning routed experts
improvement
#4597
opened May 19, 2026 by
lvhan028
Collaborator
Loading…
fix(pytorch): offload guided decoding CPU ops to thread pool to prevent event loop blocking
improvement
#4590
opened May 18, 2026 by
windreamer
Collaborator
Loading…
3 of 4 tasks
docs(advance): add Add a New Speculative Decoding Method guide
documentation
Improvements or additions to documentation
#4589
opened May 17, 2026 by
SuperMarioYL
Loading…
4 tasks done
Add OpenAI Responses-compatible endpoint
enhancement
New feature or request
#4582
opened May 13, 2026 by
CUHKSZzxy
Collaborator
Loading…
[security] fix(proxy): require auth for node management
#4579
opened May 11, 2026 by
Hinotoi-agent
Loading…
5 of 9 tasks
Fix health latency under concurrent VL request preparation
Bug:P0
#4570
opened May 7, 2026 by
CUHKSZzxy
Collaborator
Loading…
FP8 kv cache quantization
enhancement
New feature or request
#4563
opened Apr 29, 2026 by
CUHKSZzxy
Collaborator
Loading…
[Feature] Add guided decoding support for speculative decoding
enhancement
New feature or request
#4559
opened Apr 28, 2026 by
windreamer
Collaborator
Loading…
4 tasks done
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.