-
Notifications
You must be signed in to change notification settings - Fork 180
Pull requests: SemiAnalysisAI/InferenceX
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Try official TRT-LLM release image 1.3.0rc15.post1 for DSv4 B200/B300 (non-MTP)
sweep-enabled
#1636
opened Jun 1, 2026 by
Oseltamivir
Collaborator
Loading…
feat(power): vendor-agnostic GPU power/telemetry aggregation core
#1635
opened Jun 1, 2026 by
arygupt
Collaborator
Loading…
2 of 3 tasks
Enable Rust frontend (VLLM_USE_RUST_FRONTEND=1)
#1634
opened Jun 1, 2026 by
chunfangamd
Collaborator
Loading…
[AMD] Add DeepSeek-V4-Pro FP4 MI355X SGLang MTP recipe
full-sweep-enabled
#1631
opened May 31, 2026 by
Oseltamivir
Collaborator
Loading…
[AMD] Add DeepSeek-R1-0528 FP8 MI355X ATOM MTP3 benchmark
AMD
full-sweep-enabled
#1628
opened May 31, 2026 by
seungrokj
Collaborator
Loading…
2 tasks
[Klaud Cold] Update gptoss-fp4-mi300x-vllm vLLM ROCm image to v0.22.0
full-sweep-enabled
#1621
opened May 30, 2026 by
functionstackx
Collaborator
Loading…
1 task
[Klaud Cold] Update minimaxm2.5-fp8-mi300x-vllm vLLM ROCm image to v0.22.0
full-sweep-enabled
#1618
opened May 30, 2026 by
functionstackx
Collaborator
Loading…
1 task
[Klaud Cold] Update kimik2.5-int4-mi300x-vllm vLLM ROCm image to v0.22.0
full-sweep-enabled
#1615
opened May 30, 2026 by
functionstackx
Collaborator
Loading…
1 task
[Klaud Cold] Update kimik2.5-int4-mi325x-vllm vLLM ROCm image to v0.22.0
full-sweep-enabled
#1614
opened May 30, 2026 by
functionstackx
Collaborator
Loading…
1 task
[Klaud Cold] Update kimik2.5-int4-mi355x-vllm vLLM ROCm image to v0.22.0
full-sweep-enabled
#1613
opened May 30, 2026 by
functionstackx
Collaborator
Loading…
1 task
[Klaud Cold] Update minimaxm2.5-fp4-b300-vllm vLLM image to v0.22.0
full-sweep-enabled
#1612
opened May 30, 2026 by
functionstackx
Collaborator
Loading…
1 task
[Klaud Cold] Update minimaxm2.5-fp8-b300-vllm vLLM image to v0.22.0
full-sweep-enabled
#1608
opened May 30, 2026 by
functionstackx
Collaborator
Loading…
1 task
[Klaud Cold] Update gptoss-fp4-h100-vllm vLLM image to v0.22.0
full-sweep-enabled
#1605
opened May 30, 2026 by
functionstackx
Collaborator
Loading…
1 task
[Klaud Cold] Update kimik2.5-fp4-b300-vllm vLLM image to v0.22.0
full-sweep-enabled
#1603
opened May 30, 2026 by
functionstackx
Collaborator
Loading…
1 task
[Klaud Cold] Update kimik2.5-int4-h100-vllm vLLM image to v0.22.0
full-sweep-enabled
#1601
opened May 30, 2026 by
functionstackx
Collaborator
Loading…
1 task
[Klaud Cold] Update kimik2.5-int4-b300-vllm vLLM image to v0.22.0
full-sweep-enabled
#1599
opened May 30, 2026 by
functionstackx
Collaborator
Loading…
1 task
[Klaud Cold] Update dsv4-fp8-h200-vllm (+mtp) vLLM image to v0.22.0
full-sweep-enabled
#1597
opened May 30, 2026 by
functionstackx
Collaborator
Loading…
1 task
[Klaud Cold] Update dsv4-fp4-b200-vllm (+mtp) vLLM image to v0.22.0
full-sweep-enabled
#1596
opened May 30, 2026 by
functionstackx
Collaborator
Loading…
1 task
[Klaud Cold] Update dsv4-fp4-b300-vllm (+mtp) vLLM image to v0.22.0
full-sweep-enabled
#1595
opened May 30, 2026 by
functionstackx
Collaborator
Loading…
1 task
Add SPEED-Bench reference synthetic AL values for DeepSeek-V4-Pro MTP 1-8
#1592
opened May 30, 2026 by
qiching
Loading…
ci(disagg): fail before writing result file + surface real failure class
#1591
opened May 29, 2026 by
arygupt
Collaborator
Loading…
fix(process_result): fail loudly on zero-throughput disagg runs (no more masked ZeroDivisionError)
#1590
opened May 29, 2026 by
arygupt
Collaborator
Loading…
[WIP] Update DSv4 B300 vllm image tag
full-sweep-enabled
#1588
opened May 29, 2026 by
wzhao18
Collaborator
Loading…
Add DSV4 GB300 wide-EP sweep configs (EP=12/16/24/32/40)
full-sweep-enabled
#1586
opened May 29, 2026 by
yhyang201
Collaborator
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-05-01.