Skip to content

Pull requests: InternLM/lmdeploy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

chore: update deepgemm revision
#4713 opened Jun 26, 2026 by CUHKSZzxy Collaborator Loading…
Respect --server-port in DP mode when proxy-url is set improvement
#4712 opened Jun 26, 2026 by lvhan028 Collaborator Loading…
Multi node ascend
#4711 opened Jun 26, 2026 by jinminxi104 Collaborator Draft
refactor: rename vl package to multimodal BC-breaking
#4710 opened Jun 26, 2026 by CUHKSZzxy Collaborator Draft
replace sync with wait event in h2d
#4709 opened Jun 25, 2026 by grimoire Collaborator Loading…
feat: share multimodal hash helpers
#4704 opened Jun 24, 2026 by CUHKSZzxy Collaborator Loading…
Reprobe once for health request
#4703 opened Jun 24, 2026 by RunningLeon Collaborator Loading…
fix: fail fast on invalid serve parsers
#4701 opened Jun 23, 2026 by CUHKSZzxy Collaborator Loading…
Fix stale prefix cache hit rate metric
#4699 opened Jun 22, 2026 by DavinciEvans Loading…
Optimize TTFT improvement
#4695 opened Jun 22, 2026 by grimoire Collaborator Loading…
1 task done
Remove interactive chat and make inference stateless
#4694 opened Jun 22, 2026 by lvhan028 Collaborator Draft
chore: remove deprecated model support
#4693 opened Jun 22, 2026 by CUHKSZzxy Collaborator Loading…
Support long-context and MTP prefix-cache hits enhancement New feature or request
#4688 opened Jun 17, 2026 by grimoire Collaborator Loading…
fix: gate multimodal preprocessing concurrency
#4687 opened Jun 17, 2026 by CUHKSZzxy Collaborator Loading…
[Improve]: Remove dlblas from lmdeploy improvement
#4682 opened Jun 16, 2026 by RunningLeon Collaborator Loading…
fix: parse multimodal tool messages Bug:P1
#4680 opened Jun 16, 2026 by CUHKSZzxy Collaborator Loading…
Batch invariant support PART1
#4666 opened Jun 10, 2026 by grimoire Collaborator Draft
refactor: unify interleaved MRoPE rotary embedding improvement
#4644 opened Jun 3, 2026 by CUHKSZzxy Collaborator Loading…
feat: add multimodal and preemption metrics
#4640 opened Jun 1, 2026 by CUHKSZzxy Collaborator Loading…
modify save model in lite module improvement
#4624 opened May 26, 2026 by 43758726 Contributor Loading…
feat(turbomind): support priority schedule policy
#4614 opened May 22, 2026 by 4mengy Loading…
3 of 4 tasks
ProTip! Follow long discussions with comments:>50.