Deep Q Learning - define an adaptive critic learning rate?

Niklas Reinisch

16 Juil 2020

1 Réponse

Réponse acceptée

Mise à jour 29 Juil 2020

8 Vues (30 jours)

Connectez-vous pour répondre à cette question.

Follow Question

Connectez-vous pour répondre à cette question.

Follow Question

Afficher commentaires plus anciens

0 votes

Hello,

at the moment i use Deep Q Learning for process planning and i would like to use an adaptive critic learning rate to speed up the training.

Is there any direct way (or workaround) to use a learning rate that lowers over the training process, e.g. depending on the number of epochs/steps, in DQL?

Thanks in advance and best wishes

Niklas

0 commentaires
Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens

Connectez-vous pour commenter.

Connectez-vous pour répondre à cette question.

Follow Question

Réponse acceptée

Emmanouil Tzorakoleftherakis le 16 Juil 2020

0 votes

Hi Niklas,

I believe this is currently not supported. This is an interesting usecase though - I will inform the development team. Is there any particular model you have in mind that would work better? For example linear/exponential decay, etc.

7 commentaires
Afficher 5 commentaires plus anciens Masquer 5 commentaires plus anciens

Niklas Reinisch le 21 Juil 2020

Thanks for your advice, i think i got your point!

Magnify le 29 Juil 2020

Modifié(e) : Magnify le 29 Juil 2020

There is one more question why the frequency of Agent's action outport is 0.05s rather than 0.025s specified by the agent sample time in my script createDDPGAgent.m, moreover, there is no way to modify it. there is a picture about sample time display as follow:

I would appreciate it if you give some tips to me.

Connectez-vous pour commenter.

Plus de réponses (0)

Connectez-vous pour répondre à cette question.

Catégories

En savoir plus sur Reinforcement Learning dans Centre d'aide et File Exchange

Produits

Version

R2020a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

Deep Q Learning - define an adaptive critic learning rate?

0 commentaires Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens

Réponse acceptée

7 commentaires Afficher 5 commentaires plus anciens Masquer 5 commentaires plus anciens

Plus de réponses (0)

Catégories

Produits

Version

Tags

Voir également

Community Treasure Hunt

0 commentaires
Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens

7 commentaires
Afficher 5 commentaires plus anciens Masquer 5 commentaires plus anciens