Skip to content

Pull requests: InternLM/lmdeploy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Integrate deep-ep nccl backend
#4477 opened Mar 27, 2026 by irexyc Loading…
feat: Turbomind linear gdn prefix caching
#4465 opened Mar 25, 2026 by lapy Loading…
refactor get_ppl improvement
#4461 opened Mar 25, 2026 by lvhan028 Loading…
feat: implement Turbomind vision encoder support for Qwen3VL/3.5 families enhancement New feature or request
#4460 opened Mar 24, 2026 by lapy Loading…
Support multi stop words improvement
#4454 opened Mar 24, 2026 by lvhan028 Loading…
Draft model update params
#4452 opened Mar 24, 2026 by CUHKSZzxy Loading…
[ascend] fix prefix caching
#4448 opened Mar 23, 2026 by yao-fengchen Draft
docs: add gitcgr code graph badge
#4446 opened Mar 22, 2026 by vitali87 Loading…
[WIP]: qwen35 mtp WIP
#4437 opened Mar 20, 2026 by RunningLeon Draft
update h config and add glm4.7 mtp test
#4424 opened Mar 18, 2026 by littlegy Loading…
lmdeploy support kernel block size
#4421 opened Mar 17, 2026 by Tsundoku958 Loading…
[WIP] Support qwen3-omni
#4411 opened Mar 13, 2026 by CUHKSZzxy Draft
2 of 4 tasks
[Ascend] support qwen3.5 27B
#4395 opened Mar 4, 2026 by wanfengcxz Draft
add tool and reasoning test
#4388 opened Mar 2, 2026 by littlegy Loading…
Fix Structured Output for GPT-OSS Models
#4386 opened Mar 2, 2026 by windreamer Loading…
Improve proxy server improvement
#4354 opened Feb 12, 2026 by lvhan028 Loading…
Support MiniMax-M2 in TurboMind engine enhancement New feature or request
#4343 opened Feb 10, 2026 by zh-nj Loading…
[WIP]Support torch compile
#4336 opened Feb 8, 2026 by grimoire Draft
add preliminary support for EP(single-node) of turbomind backend enhancement New feature or request
#4332 opened Feb 6, 2026 by irexyc Loading…
ProTip! Updated in the last three days: updated:>2026-03-25.