-
Notifications
You must be signed in to change notification settings - Fork 158
Pull requests: SemiAnalysisAI/InferenceX
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[AMD] Add DSv4 FP4 MI355X vLLM benchmark
sweep-enabled
#1282
opened May 5, 2026 by
Oseltamivir
Collaborator
Loading…
5 tasks
[WIP][NV] Qwen3.5 fp4 b200 trt
full-sweep-enabled
NVIDIA
#1280
opened May 4, 2026 by
hshrivastava-droid
Collaborator
Loading…
[BLOCKED BY TRT PR] Add DSv4 B200 TRT
full-sweep-enabled
#1277
opened May 4, 2026 by
Oseltamivir
Collaborator
Loading…
Tune MiniMax MI355X vLLM scheduling thresholds
#1276
opened May 4, 2026 by
jiacao-amd
Collaborator
Loading…
Bump actions/setup-python from 5 to 6 in the github-actions group
dependencies
Pull requests that update a dependency file
github_actions
Pull requests that update GitHub Actions code
#1274
opened May 4, 2026 by
dependabot
Bot
Loading…
[codex] Use PR41015 fix setup for GB200 MTP2 high throughput
full-sweep-enabled
#1273
opened May 4, 2026 by
alec-flowers
Collaborator
Loading…
[WIP] Updated DSv4 vllm B300 MTP
full-sweep-enabled
#1271
opened May 4, 2026 by
wzhao18
Collaborator
Loading…
[DNM - test PR, waiting for nvidia/tensorrtllm to accept the fusedmhc patch] Update DSV4 TRT fused MHC image
full-sweep-enabled
#1270
opened May 3, 2026 by
Oseltamivir
Collaborator
Loading…
Clean up DSv4 ATOM AITER PR2998 overlay
full-sweep-enabled
#1260
opened May 2, 2026 by
Oseltamivir
Collaborator
Loading…
1 task
[NV] qwen3.5 fp4 b200 sglang mtp
full-sweep-enabled
NVIDIA
#1257
opened May 1, 2026 by
hshrivastava-droid
Collaborator
Loading…
[AMD][ROCM] Add MI355X Config: glm5-fp4-mi355x-sglang-mtp
#1254
opened May 1, 2026 by
ChangLiu0709
Collaborator
•
Draft
4 of 5 tasks
[AMD][ROCM] Fix benchmark_serving Rust Tokenizer Crash via Direct transformers AutoTokenizer
#1253
opened May 1, 2026 by
ChangLiu0709
Collaborator
•
Draft
3 of 4 tasks
Add SGLANG_OPT_USE_MULTI_STREAM_OVERLAP=1 to SGLang DSv4 launch configs
full-sweep-enabled
#1246
opened May 1, 2026 by
yhyang201
Collaborator
Loading…
2 tasks
[AMD] Update MI355x Deepseek-R1 FP4 SGLang Image to v0.5.10
#1237
opened Apr 30, 2026 by
ppalanga
Collaborator
Loading…
[AMD] improve dsr1 fp4 disagg perf on mi355x
AMD
full-sweep-enabled
#1236
opened Apr 30, 2026 by
billishyahao
Collaborator
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.