Replies: 1 comment 1 reply
-
As per my understanding, In the 2nd mode however, the model will do the forward pass without the Sigmoid and leads to different loss. This also changes the back propagation that is performed on the loss and the gradient steps. Hence, the results are different. (In this model, loss at each step is not calculated on the "sigmoid" generated forward pass like the previous model.) In the 1st model you are using sigmoid only to get the y_pred values and you do not perform any back propagation on that. It's just used to measure the accuracy |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
and then
produces different results than
and
Why is that?
Beta Was this translation helpful? Give feedback.
All reactions