When training an agent using the Reinforcement Learning Toolbox, how can I use a custom stopping criterion?

Question

DHRUV LAAD le 2 Jan 2020

2
Lien

Utiliser le lien direct vers cette question

https://fr.mathworks.com/matlabcentral/answers/498677-when-training-an-agent-using-the-reinforcement-learning-toolbox-how-can-i-use-a-custom-stopping-cri

Commenté : goc3 le 14 Juil 2020

The current options only allow for 5 predefined choices ("AverageSteps", "AverageReward", "EpisodeReward", "GlobalStepCount", "EpisodeCount"). I want to include a stopping criterion different from these. Is there any option to do the same?

1 commentaire
Afficher -1 commentaires plus anciensMasquer -1 commentaires plus anciens

goc3 le 14 Juil 2020

I was about to ask a similar question... The "accepted" answer below doesn't actually answer the question—instead, it confirms that those are the only available stop criteria.

It would be great if additional options and/or support for custom stopping criteria were added.

As an example, for a particular application, I would like to stop training once the episode reward plateaus. It is not known beforehand at what value it will plateau, so having to set a constant before training is very limiting for any application that is programmed to be dynamic or to proceed automatically based on training results.

Connectez-vous pour commenter.

Connectez-vous pour répondre à cette question.

Answer 1

Rajani Mishra le 6 Jan 2020

0
Lien

Utiliser le lien direct vers cette réponse

https://fr.mathworks.com/matlabcentral/answers/498677-when-training-an-agent-using-the-reinforcement-learning-toolbox-how-can-i-use-a-custom-stopping-cri#answer_408842

trainOpts = rlTrainingOptions(Name,Value) creates an option set for training using specified name-value pairs.

Arguments like - 'StopTrainingCriteria', 'StopTrainingValue', 'MaxEpisodes' should be specified for defining stopping criterion while training an agent.

StopTrainingCriteria: Specifies the termination condition. Takes one of the choices as you have mentioned

StopTrainingValue: Specifies the Critical value of training termination condition. Training terminates when the termination condition specified by the StopTrainingCriteria option equals or exceeds this value

MaxEpisodes: Specifies maximum number of episodes to train the agent, once the number of episodes reached training terminates

For more information please refer to

1 commentaire
Afficher -1 commentaires plus anciensMasquer -1 commentaires plus anciens

Tuwe Löfström le 13 Juil 2020

So there is no way of adding a custom stopping criteria, in a similar way as you can define custom reset and step functions?

Connectez-vous pour commenter.

When training an agent using the Reinforcement Learning Toolbox, how can I use a custom stopping criterion?

1 commentaire
Afficher -1 commentaires plus anciensMasquer -1 commentaires plus anciens

Réponses (1)

1 commentaire
Afficher -1 commentaires plus anciensMasquer -1 commentaires plus anciens

Voir également

Catégories

Tags

Produits

Version

Community Treasure Hunt

When training an agent using the Reinforcement Learning Toolbox, how can I use a custom stopping criterion?

1 commentaire Afficher -1 commentaires plus anciensMasquer -1 commentaires plus anciens

Réponses (1)

1 commentaire Afficher -1 commentaires plus anciensMasquer -1 commentaires plus anciens

Voir également

Catégories

Tags

Produits

Version

Community Treasure Hunt

1 commentaire
Afficher -1 commentaires plus anciensMasquer -1 commentaires plus anciens

1 commentaire
Afficher -1 commentaires plus anciensMasquer -1 commentaires plus anciens