Reinforcement Learning toolbox step function

Question

Mostafa Nazmi le 7 Sep 2020

0
Lien

Utiliser le lien direct vers cette question

https://fr.mathworks.com/matlabcentral/answers/590056-reinforcement-learning-toolbox-step-function

Commenté : Kamalova Albina le 21 Fév 2022

Greetings everyone, I hope you're having a good time. In reinforcement learning toolbox there's a functin named "step(env, Action)", I wanted to know what is the role of the input "Action" in this function?

[Observation, Reward, IsDone, LoggedSignals] = step(env, Action)

0 commentaires
Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens

Connectez-vous pour commenter.

Connectez-vous pour répondre à cette question.

Answer 1

Stephan le 7 Sep 2020

0
Lien

Utiliser le lien direct vers cette réponse

https://fr.mathworks.com/matlabcentral/answers/590056-reinforcement-learning-toolbox-step-function#answer_490942

Modifié(e) : Stephan le 7 Sep 2020

The action the agent has choosen in the last step, usually has an impact on the environment. To let the step function know what action was choosen the step before, you have to refer the last action to the next call of the step function, which then - based on this informations calculates the next observation, the reward and the iSDone flag.

See this example:

https://de.mathworks.com/help/reinforcement-learning/ug/create-custom-reinforcement-learning-environment-in-matlab.html

In the example given in the link above the action is a directed force that is applied to the system in the following step to calculate the new observations from the current step.

Building on that the step function can calculate the reward and if the IsDone value is true. Using these informations the agent gets a new information from the environment, which is the basis for the choice of the next action.

3 commentaires
Afficher 1 commentaire plus ancienMasquer 1 commentaire plus ancien

Maha Mosalam le 22 Nov 2021

Hi, what about the xact role of IsDone flag it it shuld be true or false or what?

Kamalova Albina le 21 Fév 2022

IsDone flag means the episode is finished or not. It should have a condition logic. For example, let's say you are hungry and you decide to eat something. In step function, you are continuously eating while do the actions to choose fry potato or tomato (maybe). How to know you are done and full already?! IsDone is this flag for showing you should stop this eating episode

Connectez-vous pour commenter.

Reinforcement Learning toolbox step function

0 commentaires
Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens

Réponse acceptée

3 commentaires
Afficher 1 commentaire plus ancienMasquer 1 commentaire plus ancien

Plus de réponses (0)

Voir également

Catégories

Tags

Produits

Version

Community Treasure Hunt

Reinforcement Learning toolbox step function

0 commentaires Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens

Réponse acceptée

3 commentaires Afficher 1 commentaire plus ancienMasquer 1 commentaire plus ancien

Plus de réponses (0)

Voir également

Catégories

Tags

Produits

Version

Community Treasure Hunt

0 commentaires
Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens

3 commentaires
Afficher 1 commentaire plus ancienMasquer 1 commentaire plus ancien