Using Mixed Precision but calculate loss using Full Precision #9132

JoakimHaurum · 2021-08-26T09:53:15Z

JoakimHaurum
Aug 26, 2021

Hi,

I am currently training a network for multi label classification.
For this I am using the Asymmetric Loss from Alibaba (https://github.com/Alibaba-MIIL/ASL)
However, when I train with mixed precision the loss goes to NaN.

This is a known problem (Alibaba-MIIL/ASL#53) and has been circumvented by calculating the loss in full precision 32-bit, while during the rest of the forward pass in mixed precision.

Is this possible with pytorch lightning or should I refactor the loss implementation?

tchaton · 2021-08-26T10:22:50Z

tchaton
Aug 26, 2021
Maintainer

Dear @JoakimHaurum,

Yes, you can use torch.cuda.autocast(enabled=False) around the forward of your loss computation.

Also, we just merge support for bfloat16 which is more stable than float16.

It can be used with Trainer(precision="b16") on master + Torch nightly.

Best,
T.C

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Using Mixed Precision but calculate loss using Full Precision #9132

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Using Mixed Precision but calculate loss using Full Precision #9132

Uh oh!

JoakimHaurum Aug 26, 2021

Replies: 1 comment

Uh oh!

tchaton Aug 26, 2021 Maintainer

JoakimHaurum
Aug 26, 2021

tchaton
Aug 26, 2021
Maintainer