custom deep learning training loop: gradient computation using dlgradient

Question

Niko Picello le 13 Mai 2021

0
Lien

Utiliser le lien direct vers cette question

https://fr.mathworks.com/matlabcentral/answers/829203-custom-deep-learning-training-loop-gradient-computation-using-dlgradient

Commenté : Niko Picello le 14 Mai 2021

I'm trying to train a CNN with semi supervised learning but i can't evaluate the automatic gradient properly: in particular when i call the function dlgradient (with loss and net.Learnables as parameters) the program invokes other functions inside it and when it's the time of backwardTape (which is also the method that, using other nested functions, is able to compute the gradient) the program fails! it happens that backwardTape is just skipped by the program (actually it gives the output grad, but if i try to step in with the debugger, i can't and it jump to the next line of the code instead); the line is:

grad = backwardTape(tm,{y},{initialAdjoint},x,retainData,false);

in backwardPass.m of the deep learning toolbox. The output grad is just a vector of empty arrays

P.S. the dlnetwork i have created is based on alexnet using transfer learning.

part of the code of interest is:

    loss = labeledLoss + unlabeledLoss; %this two statements are inside a training loop
    gradients = dlfeval(@computeModelGradients,net,loss);
function gradients = computeModelGradients(network,loss)
    gradients = dlgradient(loss,network.Learnables);
end
%where: 
%studentNet is a 1x1 dlNetwork of 24 layers (of which 22 are from alexnet
%and the last 2 are a fully connected and a softmax) 
%loss is 1x1 dlArray (which contain a double)

0 commentaires
Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens

Connectez-vous pour commenter.

Connectez-vous pour répondre à cette question.

Answer 1

Mohamed Marei le 14 Mai 2021

1
Lien

Utiliser le lien direct vers cette réponse

https://fr.mathworks.com/matlabcentral/answers/829203-custom-deep-learning-training-loop-gradient-computation-using-dlgradient#answer_700073

Ouvrir dans MATLAB Online

I think I ran into a similar problem when attempting to train a ResNet-18-based model for transfer learning, too. I had to hard-code my evaluation and update step which was by no means straightforward.

In your case, you might want to compute the loss inside the call to dlfeval.

function [loss, gradients] = computeModelGradients(network, pred_labelled, tgts_labelled, pred_unlabelled)
    labelled_loss = crossentropy(predictions_labelled, targets_labelled); % your loss definition here
    unlabelled_loss =  myfunction(pred_unlabelled); % your loss function for the unlabeled predictions
    loss = labelled_loss + unlabelled_loss;
    gradients = dlgradient(loss, network);
end

1 commentaire
Afficher -1 commentaires plus anciensMasquer -1 commentaires plus anciens

Niko Picello le 14 Mai 2021

it works, thank you very much!

Connectez-vous pour commenter.

custom deep learning training loop: gradient computation using dlgradient

0 commentaires
Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens

Réponses (1)

1 commentaire
Afficher -1 commentaires plus anciensMasquer -1 commentaires plus anciens

Voir également

Catégories

Tags

Produits

Version

Community Treasure Hunt

custom deep learning training loop: gradient computation using dlgradient

0 commentaires Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens

Réponses (1)

1 commentaire Afficher -1 commentaires plus anciensMasquer -1 commentaires plus anciens

Voir également

Catégories

Tags

Produits

Version

Community Treasure Hunt

0 commentaires
Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens

1 commentaire
Afficher -1 commentaires plus anciensMasquer -1 commentaires plus anciens