-
Notifications
You must be signed in to change notification settings - Fork 674
Pull requests: InternLM/lmdeploy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: implement Turbomind vision encoder support for Qwen3VL/3.5 families
enhancement
New feature or request
#4460
opened Mar 24, 2026 by
lapy
Loading…
[Feature] Support n parameter in /v1/chat/completions and /v1/completions
#4419
opened Mar 17, 2026 by
ziyangliu-666
Loading…
Assign sequential api_server ports when proxy_url is unset
improvement
#4416
opened Mar 16, 2026 by
lvhan028
Loading…
[Fix][Feat] Fix worker sorting with external pg bundles & Support persistent buffer for update_params
#4397
opened Mar 6, 2026 by
CyCle1024
Loading…
Support MiniMax-M2 in TurboMind engine
enhancement
New feature or request
#4343
opened Feb 10, 2026 by
zh-nj
Loading…
add preliminary support for EP(single-node) of turbomind backend
enhancement
New feature or request
#4332
opened Feb 6, 2026 by
irexyc
Loading…
change ascend paged attention from BSH format to TND format for better performace
#4295
opened Jan 27, 2026 by
jinminxi104
•
Draft
Previous Next
ProTip!
Updated in the last three days: updated:>2026-03-25.