Reinforcement Learning Toolbox: DDPG Agent, Q0 =0 during the whole training (more than 5000 iterations)
Afficher commentaires plus anciens
I implemented a DDPG Agent in Matlab's Reinforcement Learning Toolbox with a custom enviroment.
At the beginning I used only a few neurons per hidden layer (8-60) and learning rates between 0.1 and 10 for the critic and actor.
But the problem didn't converges, so I increased the number of neurons per hidden layer (300-400) and decreased the learning rate to about 0.0001.
However, the results are better but it don't converge at all.
But I noticed that the Q0 do not change during the training. Maybe that causes some problems.
Q0 is during the whole training 0. Attached you can find the screenshot of the episode manager.
Somehow, Q0 had changed during the training with a 'old' setup (8-60 neurons per hidden layers and learning rate in between 0.1 and 10)
Does anyone have any idea what went wrong?
Does anyone have any tips for me?
Thanks in advance!
1 commentaire
Rik
le 2 Oct 2020
Why is this thread such a magnet for spam? 1 caught by the spam filter, and 6 not (I'll delete those now as well).
Réponse acceptée
Plus de réponses (0)
Catégories
En savoir plus sur Reinforcement Learning Toolbox dans Centre d'aide et File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!