-
Notifications
You must be signed in to change notification settings - Fork 281
Pull requests: ROCm/aiter
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[TRITON] Fix Something isn't working
ci:triton-355
triton
test_moe_routing_sigmoid_top1_fused reference implementation tie-breaking
bug
#2750
opened Apr 15, 2026 by
brunomazzottiamd
Contributor
Loading…
1 task done
Add tuned MoE configs for Qwen3-Next-80B-A3B FP8 on MI355X
#2748
opened Apr 15, 2026 by
nholmber
Loading…
1 task done
fix(car): sglang prefill launch error kernel
#2745
opened Apr 15, 2026 by
TennyWang1223
Contributor
Loading…
1 task
Add glm5 70k 300 triton a8w8 blockscale configs
#2743
opened Apr 14, 2026 by
amd-pedghazi
Loading…
1 task done
fix moe splitk aot and jit
ci:atom
#2738
opened Apr 14, 2026 by
lalala-sh
Contributor
Loading…
1 task
Revert max_size from 1GB to 128MB to fix KV cache regression
#2737
opened Apr 14, 2026 by
AMD-yanfeiwang
Contributor
•
Draft
1 task
introduce g1u0 smoothquant int8 fused moe : fused_moe_gelu_sqi8
#2730
opened Apr 14, 2026 by
tingqli
Loading…
1 task
Add bf16 MLA decode kernel for gqa_ratio=64, qseqlen=1 (non-persistent)
#2729
opened Apr 14, 2026 by
fangche123
Contributor
Loading…
MI350 mla ps mode suppport nhead128,1 128,2 128,3 128,4 64,4 64,2 32,4 through kernel mla_a16w16_qh32_qseqlen4_gqaratio32_ps.co
#2727
opened Apr 14, 2026 by
minmengdie
Contributor
Loading…
1 task
[TRITON] Add unified attention support to bench_models
enhancement
New feature or request
triton
#2724
opened Apr 13, 2026 by
lucas-santos-amd
Contributor
Loading…
1 task
docs: comprehensive documentation overhaul
#2706
opened Apr 12, 2026 by
sunway513
Collaborator
Loading…
4 tasks
feat: add Gemma4 31B support (ProportionalRotaryEmbedding, rmsnorm dtype)
#2705
opened Apr 12, 2026 by
ClementLinCF
Collaborator
Loading…
1 task done
Update quant.pyfix: add pack_dim to per_1x32_f4_quant for tl.dot_scaled RHS compatibility
#2704
opened Apr 12, 2026 by
GeisYaO
Loading…
fea(car): support custom group device
#2703
opened Apr 12, 2026 by
TennyWang1223
Contributor
Loading…
1 task
Add ROCm-versioned wheel naming to release workflow
#2698
opened Apr 11, 2026 by
sunway513
Collaborator
Loading…
3 tasks
Add FlyDSL fused RoPE + KV Cache backend
#2697
opened Apr 11, 2026 by
coderfeli
Collaborator
Loading…
2 of 3 tasks
[Triton] Declare triton>=3.6.0 dependency
ci:triton-355
#2695
opened Apr 10, 2026 by
micmelesse
Contributor
Loading…
1 task
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.