Skip to content

[ROCm] Enable VLLM triton FP8 moe for gfx1201, tuned for Qwen3-30B-A3B-FP8 tp=2 and Qwen/Qwen3.5-35B-A3B-FP8 tp=2#79

Open
big-yellow-duck wants to merge 3 commits intomainfrom
rdna4-moe
Open

[ROCm] Enable VLLM triton FP8 moe for gfx1201, tuned for Qwen3-30B-A3B-FP8 tp=2 and Qwen/Qwen3.5-35B-A3B-FP8 tp=2#79
big-yellow-duck wants to merge 3 commits intomainfrom
rdna4-moe

Commits

Commits on Mar 28, 2026

Commits on Apr 1, 2026

Commits on Apr 2, 2026