-
In chapter 4.1, the model output is: |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 10 replies
-
Yeah, it seems like the logits will be computed slightly different in newer PyTorch versions. I am using
|
Beta Was this translation helpful? Give feedback.
-
Does this difference in the initial stage matter? Or it will diminish with the ongoing training? |
Beta Was this translation helpful? Give feedback.
I have the same output
on Ubuntu as on Windows as they both use MKL (Intel Math Kernel…