Skip to content

Pull requests: jd-opensource/xllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: update mlu container to 26.04.
#1535 opened May 23, 2026 by phantomlei3 Collaborator Draft
refactor: refact multimodal processor.
#1530 opened May 22, 2026 by wly-115 Collaborator Loading…
feat: support MiMo-7B-Base on cuda device.
#1523 opened May 22, 2026 by Dragonliu2018 Contributor Loading…
feat: add dcu backend support.
#1522 opened May 22, 2026 by WenQ7 Loading…
feat: support dumping xllm server flags to json file.
#1518 opened May 22, 2026 by XuZhang99 Collaborator Loading…
feat:remove unused func and support deepseek_v4_mtp graph on npu.
#1517 opened May 22, 2026 by panxua Contributor Loading…
feat: expose cached token usage in responses.
#1514 opened May 21, 2026 by zhang-minchao Collaborator Loading…
refactor: remove xattention one-stage decode path.
#1504 opened May 21, 2026 by LMX-xin Collaborator Draft
feat: enable REC XAttention for Qwen3 MoE on cuda device.
#1500 opened May 20, 2026 by LMX-xin Collaborator Loading…
feat: support vae parallel for qwen-image-edit-plus.
#1499 opened May 20, 2026 by shan-chen-feng Collaborator Loading…
feat: support customized multimodal preprocess configs.
#1481 opened May 19, 2026 by xanecdotex Collaborator Loading…
refactor: remove negative condition when choosing decode or prefill
#1475 opened May 18, 2026 by rauletorresc Contributor Loading…
feat: parallelize multimodal decode in request transfer.
#1474 opened May 18, 2026 by wly-115 Collaborator Loading…
refactor: split forward inputs from model input params [3 / 3].
#1469 opened May 18, 2026 by RobbieLeung Collaborator Loading…
feat: Support Flux2 in other common components
#1463 opened May 15, 2026 by wang-shuibin Loading…
bugfix: reduce acl graph memory overhead.
#1457 opened May 15, 2026 by RobbieLeung Collaborator Loading…
docs: exporting a draft model from a quantized model.
#1455 opened May 14, 2026 by rauletorresc Contributor Loading…
ProTip! Follow long discussions with comments:>50.