-
Notifications
You must be signed in to change notification settings - Fork 292
Pull requests: ROCm/aiter
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Re-enable the native qh128 fp8 kernel (mla_a8w8_qh128_m32x4_n16x2_msk…
#2927
opened Apr 28, 2026 by
fangche123
Contributor
Loading…
1 task
ci: enable prebuilt kernel wheels for fork PRs
#2925
opened Apr 27, 2026 by
okakarpa
Collaborator
Loading…
3 tasks
Add GLM-4.7 FP8 tuned and untuned FMOE configs
#2923
opened Apr 27, 2026 by
omirosh
Loading…
2 tasks
[TRITON] Set RNG seed in Triton tests
ci:triton-300x
ci:triton-355
triton
#2921
opened Apr 27, 2026 by
brunomazzottiamd
Contributor
Loading…
1 task done
refactor and unify triton/bench_fav3_sage.py scripts
#2920
opened Apr 27, 2026 by
jcaraban
Loading…
2 tasks
CI: auto-update split test FILE_TIMES
#2918
opened Apr 27, 2026 by
github-actions
Bot
Loading…
2 tasks done
Add a new kernel that supports decode mla with sub_kv=64 and sub_qh=8
#2917
opened Apr 25, 2026 by
JohnNikolay84
Contributor
Loading…
1 task
[Tracking] Release v0.1.13 validation (DO NOT MERGE)
ci:all
#2913
opened Apr 24, 2026 by
sunway513
Collaborator
Loading…
9 tasks
[MLA] Fix qh128 ASM nullptr write; enable native qh128 fp8 on gfx950
#2907
opened Apr 24, 2026 by
inkcherry
Contributor
Loading…
feat(triton/rope): fused QKV split, QK RMSNorm, RoPE, and paged KV cache
#2902
opened Apr 24, 2026 by
hellozhuo-amd
Loading…
1 task
[Bug] Default value of ChunkQ in deepgemm could lead to divided_by_0 error
#2891
opened Apr 24, 2026 by
qli88
Loading…
Fix 2-stage fused_allreduce_rmsnorm memory ordering
#2890
opened Apr 23, 2026 by
hubertlu-tw
Contributor
Loading…
5 tasks
Previous Next
ProTip!
Updated in the last three days: updated:>2026-04-24.