Unexpected loss reduction using custom training loop in Deep Learning Toolbox

Question

MathWorks Support Team le 19 Juil 2023

0
Lien

Utiliser le lien direct vers cette question

https://fr.mathworks.com/matlabcentral/answers/2004477-unexpected-loss-reduction-using-custom-training-loop-in-deep-learning-toolbox

Réponse apportée : MathWorks Support Team le 3 Août 2023

Réponse acceptée : MathWorks Support Team

I have created a custom training loop following the documentation example: https://www.mathworks.com/help/releases/R2023a/deeplearning/ug/train-network-using-custom-training-loop.html

However, since I use the same loss function for training and validation, I have altered the "modelloss" function so the "forward" function is outside of the function. For example:

[Y, state] = forward(net, X)
[loss,gradient] = dlfeval(@modelLoss,net,Y,T);
function [loss,gradients] = modelLoss(net,Y,T)
% Calculate cross-entropy loss.
loss = crossentropy(Y,T);
% Calculate gradients of loss with respect to learnable parameters.
gradients = dlgradient(loss,net.Learnables);
end

Now the resulting loss during training is not reducing as expected. How can I resolve this issue?

Connectez-vous pour répondre à cette question.

Answer 1

MathWorks Support Team le 19 Juil 2023

0
Lien

Utiliser le lien direct vers cette réponse

https://fr.mathworks.com/matlabcentral/answers/2004477-unexpected-loss-reduction-using-custom-training-loop-in-deep-learning-toolbox#answer_1282982

When the "dlgradient" function is used inside a second function which is called by "dlfeval", automatic differentiation is used to calculate the gradients. The "dlfeval" function traces the operations when calculating the gradient and therefore, for the loss to be calculated correctly, the functions related to finding the gradient (e.g. "forward") must remain inside the "modelloss" function called by the "dlfeval" function.

Please refer to the following documentation page for more information on automatic differentiation in Deep Learning Toolbox:

https://www.mathworks.com/help/releases/R2023a/deeplearning/ug/deep-learning-with-automatic-differentiation-in-matlab.html

Moving the "forward" function back inside the "modelLoss" function will resolve the issue. Additionally, since, the gradient is not required for validation, using the "dlfeval" function to calculate the validation loss introduces unnecessary overhead and decreases performance.

0 commentaires
Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens

Connectez-vous pour commenter.

Unexpected loss reduction using custom training loop in Deep Learning Toolbox

Réponse acceptée

0 commentaires
Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens

Plus de réponses (0)

Voir également

Catégories

Tags

Community Treasure Hunt

Unexpected loss reduction using custom training loop in Deep Learning Toolbox

Réponse acceptée

0 commentaires Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens

Plus de réponses (0)

Voir également

Catégories

Tags

Community Treasure Hunt

0 commentaires
Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens