Skip to content

lm_head is not converted to QuantLinear with MXFP4/8 #1040

@xin3he

Description

@xin3he

lm_head quantization still have some issues.

  • need deepcopy if tied_word_embedding = True
  • export is not applied for lm_head

Shall we warn user that lm_head is not supported? @WeiweiZhang1 @wenhuach21

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions