[ROCm] Enable VLLM triton FP8 moe for gfx1201, tuned for Qwen3-30B-A3B-FP8 tp=2 and Qwen/Qwen3.5-35B-A3B-FP8 tp=2#79
Open
big-yellow-duck wants to merge 3 commits intomainfrom
Open
[ROCm] Enable VLLM triton FP8 moe for gfx1201, tuned for Qwen3-30B-A3B-FP8 tp=2 and Qwen/Qwen3.5-35B-A3B-FP8 tp=2#79big-yellow-duck wants to merge 3 commits intomainfrom
big-yellow-duck wants to merge 3 commits intomainfrom