DQN learns at first but then worsens.

Question

Khandakar Rashid le 20 Avr 2021

0
Lien

Utiliser le lien direct vers cette question

https://fr.mathworks.com/matlabcentral/answers/807947-dqn-learns-at-first-but-then-worsens

Commenté : Emmanouil Tzorakoleftherakis le 23 Avr 2021

Hi, I am training a DQN agent with a simevent model. I am testing out different hyperparameters, but everytime the agent learns (reward goes higher) at first for a while, but then goes down. I have tested different learning rate, exploration epsilon, and discount factors. But the shape of training progress is pretty much same in all combinations. Is there any potential way I can fix this issue?

0 commentaires
Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens

Connectez-vous pour commenter.

Connectez-vous pour répondre à cette question.

Answer 1

Emmanouil Tzorakoleftherakis le 22 Avr 2021

0
Lien

Utiliser le lien direct vers cette réponse

https://fr.mathworks.com/matlabcentral/answers/807947-dqn-learns-at-first-but-then-worsens#answer_682275

Modifié(e) : Emmanouil Tzorakoleftherakis le 22 Avr 2021

To confirm that this is an exploration issue, can you try setting the EpsilonMin param to a high value? e.g. 0.99. If after doing that you still see the same result, there is likely something else going on.

2 commentaires
Afficher AucuneMasquer Aucune

Khandakar Rashid le 23 Avr 2021

Thank you Emmanouil for the suggestion. I have tried Epsilon = 1, EpsilonMin=0.99. Unfortunately, no luck :(

Do you have any other tips?

Emmanouil Tzorakoleftherakis le 23 Avr 2021

Hard to tell, but it's strange to me that the episode curve is similar every time. That makes me think that there is something specific about the way you have modeled your environment model that guides the training through a similar path each time.

Connectez-vous pour commenter.

DQN learns at first but then worsens.

0 commentaires
Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens

Réponses (1)

2 commentaires
Afficher AucuneMasquer Aucune

Voir également

Catégories

Tags

Produits

Version

Community Treasure Hunt

DQN learns at first but then worsens.

0 commentaires Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens

Réponses (1)

2 commentaires Afficher AucuneMasquer Aucune

Voir également

Catégories

Tags

Produits

Version

Community Treasure Hunt

0 commentaires
Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens

2 commentaires
Afficher AucuneMasquer Aucune