Inconsistent training loss curve when training UNet with and without convergence criterion

Question

Memo Remo le 22 Avr 2023

1
Lien

Utiliser le lien direct vers cette question

https://fr.mathworks.com/matlabcentral/answers/1951273-inconsistent-training-loss-curve-when-training-unet-with-and-without-convergence-criterion

Modifié(e) : Matt J le 24 Avr 2023

I want to train a semantic segmentation model using UNet layers. I trained the model one time without any convergence criterion and max epoch of 500 and the other time with a convergence criterion (validation patience of 10 and validation frequency of 25). The secnd training process converged at a lower epoch number compared to the first one. The results of the application of the second model is significantly less accurate than the first one on the test slices. I noticed that the training loss in the second training process is different from the first one. My question is that why defining a convergence criterion changes the training loss curve? I would be thankful if any one can help me understand why this is happening. The learning rate is constant and equal to 1e-05.

Training curves without convergence criterion:

Training curves with convergence criterion:

0 commentaires
Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens

Connectez-vous pour commenter.

Connectez-vous pour répondre à cette question.

Answer 1

Matt J le 22 Avr 2023

0
Lien

Utiliser le lien direct vers cette réponse

https://fr.mathworks.com/matlabcentral/answers/1951273-inconsistent-training-loss-curve-when-training-unet-with-and-without-convergence-criterion#answer_1221083

Modifié(e) : Matt J le 24 Avr 2023

There should be a change in the training loss curve - it should be shorter. By loosening the stopping criteria, you cause fewer iterations to be run.

There will also be changes in the training loss curves due to the stochastic nature of the SGD algorithm and the parameter initialization.

3 commentaires
Afficher 1 commentaire plus ancienMasquer 1 commentaire plus ancien

Matt J le 22 Avr 2023

Modifié(e) : Matt J le 22 Avr 2023

It's not clear to me how strong the differences are. The axes limits are not the same in the 2 plots.

Also, we have not been shown the 3rd plot (on the same axes as the first two) with the training curve you get after reverting back to default settings.

Memo Remo le 24 Avr 2023

Dear Matt,

I appreciate your attention.

I used another approach to train this model for now. I will check this problem again as soon as I can and provide more information. Thank you.

Connectez-vous pour commenter.

Inconsistent training loss curve when training UNet with and without convergence criterion

0 commentaires
Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens

Réponses (1)

3 commentaires
Afficher 1 commentaire plus ancienMasquer 1 commentaire plus ancien

Voir également

Catégories

Tags

Community Treasure Hunt

Inconsistent training loss curve when training UNet with and without convergence criterion

0 commentaires Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens

Réponses (1)

3 commentaires Afficher 1 commentaire plus ancienMasquer 1 commentaire plus ancien

Voir également

Catégories

Tags

Community Treasure Hunt

0 commentaires
Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens

3 commentaires
Afficher 1 commentaire plus ancienMasquer 1 commentaire plus ancien