Reinforcement Learning Toolbox - When does algorithm train?

Hans-Joachim Steinort

17 Sep 2019

1 Réponse

Réponse acceptée

Mise à jour 26 Sep 2019

4 Vues (30 jours)

Connectez-vous pour répondre à cette question.

Follow Question

Connectez-vous pour répondre à cette question.

Follow Question

Afficher commentaires plus anciens

0 votes

I am currently using the RL-Toolbox with a DQN-Agent built into a long-running process-simulation.

The maximum stepcount is currently 8000 steps per episode.

Unfortunately the documentation seems a little ambiguous to me, so here my question:

Doese the train-function of the RL-Toolbox train the agent at the end of an episode or during the episode when the step count exeeds the minibatch-size (like in the baseline algorithms)?

Thank you in advance.

0 commentaires
Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens

Connectez-vous pour commenter.

Connectez-vous pour répondre à cette question.

Follow Question

Réponse acceptée

Emmanouil Tzorakoleftherakis le 25 Sep 2019

0 votes

The implementation is based on the algorithm listed here.

Weights are being updated at each time step.

1 commentaire
Afficher -1 commentaires plus anciens Masquer -1 commentaires plus anciens

Hans-Joachim Steinort le 26 Sep 2019

"For each training time step" - that was the line I was looking for (yet looking into the source code lead me to the same conclusion).

After double-checking the baseline-algorithms I found that they do it the same way.

Thank you for your time!

Connectez-vous pour commenter.

Plus de réponses (0)

Connectez-vous pour répondre à cette question.

Catégories

En savoir plus sur Reinforcement Learning Toolbox dans Centre d'aide et File Exchange

Produits

Version

R2019a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

Reinforcement Learning Toolbox - When does algorithm train?

0 commentaires Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens

Réponse acceptée

1 commentaire Afficher -1 commentaires plus anciens Masquer -1 commentaires plus anciens

Plus de réponses (0)

Catégories

Produits

Version

Tags

Voir également

Community Treasure Hunt

0 commentaires
Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens

1 commentaire
Afficher -1 commentaires plus anciens Masquer -1 commentaires plus anciens