[BUG] LayerNorm without affine params causes errors #208

ishan-modi · 2025-06-08T16:20:42Z

What does this PR do?

Error occurs when we check m.weights.is_meta here for module of type nn.LayerNorm with affine params as False, because it initializes weights as None see here

This change checks for weights being None and bypasses the module if it holds true

meenchen · 2025-06-09T20:29:06Z

Hi @ishan-modi, could you provide a code snippet to produce the error? I want to add some tests to cover the use case.

ishan-modi · 2025-06-10T04:27:05Z

Thanks for response @meenchen, please find the code snippet

from diffusers import SanaTransformer2DModel
import modelopt.torch.quantization as mtq
from modelopt.torch.quantization.config import FP8_DEFAULT_CFG
from modelopt.torch.opt import enable_huggingface_checkpointing

enable_huggingface_checkpointing()

checkpoint = "Efficient-Large-Model/Sana_600M_1024px_diffusers"
model = SanaTransformer2DModel.from_pretrained(checkpoint, subfolder="transformer")

FP8_DEFAULT_CFG['quant_cfg']['*weight_quantizer'].update({'fake_quant': False})
FP8_DEFAULT_CFG['quant_cfg']['*input_quantizer'].update({'enable': False})
mtq.quantize(model, FP8_DEFAULT_CFG)
mtq.compress(model)
model.save_pretrained('test_modelopt_quant')
checkpoint = "./test_modelopt_quant"

model = SanaTransformer2DModel.from_pretrained(checkpoint)

diffusers == 0.33.1
modelopt == 0.31.0

ishan-modi · 2025-06-29T16:24:27Z

@meenchen, any updates on this ?

ishan-modi · 2025-07-24T04:33:08Z

@meenchen @kevalmorabia97, this is little critical for enabling modelopt in huggingface diffusers. Can we release a patch for this because this bug might also be occurring in generative models using layernorm without affine params ?

kevalmorabia97 · 2025-07-25T19:57:47Z

Hi @ishan-modi sorry we couldnt get this in the previous release. We'll include this fix in the upcoming release 0.35.0 planned in about 2 weeks. Would that timeline be fine?

ishan-modi · 2025-07-26T06:07:35Z

Thanks @kevalmorabia97! Patch release would be great for faster integration into diffusers, though next release is fine too.

kevalmorabia97 · 2025-07-31T17:20:08Z

Fix is merged in main branch but not in the official release. Temporary workaround would be to install modelopt from main branch instead of wheels pip install git+https://github.com/NVIDIA/TensorRT-Model-Optimizer.git

kevalmorabia97 · 2025-08-12T18:54:34Z

@ishan-modi 0.33.1 release is now live which includes this fix

ishan-modi · 2025-08-14T04:09:14Z

Thanks @kevalmorabia97

ishan-modi added 2 commits June 8, 2025 21:41

check for weights being None

815d622

update

4c820a2

meenchen self-assigned this Jun 9, 2025

ishan-modi added 2 commits June 19, 2025 22:24

Merge branch 'main' into fix-null-weights

d84c99f

Merge branch 'main' into fix-null-weights

a672d30

ishan-modi mentioned this pull request Jul 21, 2025

[Quantization] Add TRT-ModelOpt as a Backend huggingface/diffusers#11173

Merged

Merge branch 'main' into fix-null-weights

0991404

kevalmorabia97 requested review from cjluo-nv and meenchen July 24, 2025 09:06

kevalmorabia97 closed this Jul 31, 2025

ishan-modi deleted the fix-null-weights branch August 3, 2025 07:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BUG] LayerNorm without affine params causes errors #208

[BUG] LayerNorm without affine params causes errors #208

Uh oh!

ishan-modi commented Jun 8, 2025

Uh oh!

meenchen commented Jun 9, 2025

Uh oh!

ishan-modi commented Jun 10, 2025

Uh oh!

ishan-modi commented Jun 29, 2025

Uh oh!

ishan-modi commented Jul 24, 2025

Uh oh!

kevalmorabia97 commented Jul 25, 2025

Uh oh!

ishan-modi commented Jul 26, 2025

Uh oh!

kevalmorabia97 commented Jul 31, 2025

Uh oh!

kevalmorabia97 commented Aug 12, 2025

Uh oh!

ishan-modi commented Aug 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[BUG] LayerNorm without affine params causes errors #208

[BUG] LayerNorm without affine params causes errors #208

Uh oh!

Conversation

ishan-modi commented Jun 8, 2025

What does this PR do?

Uh oh!

meenchen commented Jun 9, 2025

Uh oh!

ishan-modi commented Jun 10, 2025

Uh oh!

ishan-modi commented Jun 29, 2025

Uh oh!

ishan-modi commented Jul 24, 2025

Uh oh!

kevalmorabia97 commented Jul 25, 2025

Uh oh!

ishan-modi commented Jul 26, 2025

Uh oh!

kevalmorabia97 commented Jul 31, 2025

Uh oh!

kevalmorabia97 commented Aug 12, 2025

Uh oh!

ishan-modi commented Aug 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants