Skip to content

Commit 2f1d411

Browse files
committed
remove unnecessary output
Signed-off-by: Jinzhen Lin <[email protected]>
1 parent 1a06e4f commit 2f1d411

File tree

1 file changed

+1
-3
lines changed

1 file changed

+1
-3
lines changed

vllm/model_executor/layers/quantization/utils/marlin_utils_fp4.py

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -120,11 +120,9 @@ def apply_fp4_marlin_linear(
120120

121121
inputs, a_scales = marlin_quant_input(inputs, torch.float8_e4m3fn)
122122

123-
output = torch.empty(out_shape, dtype=reshaped_x.dtype, device=reshaped_x.device)
124-
125123
output = ops.gptq_marlin_gemm(
126124
a=inputs,
127-
c=output,
125+
c=None,
128126
b_q_weight=weight,
129127
b_bias=bias,
130128
b_scales=weight_scale,

0 commit comments

Comments
 (0)