Action value exceeds the boundry of the final layer activation fucntion of the actor

1 vue (au cours des 30 derniers jours)

Afficher commentaires plus anciens

awcii le 17 Juin 2023

0
Lien

Utiliser le lien direct vers cette question

https://fr.mathworks.com/matlabcentral/answers/1984659-action-value-exceeds-the-boundry-of-the-final-layer-activation-fucntion-of-the-actor

Commenté : awcii le 19 Juin 2023

Ouvrir dans MATLAB Online

Hi,

I'm using DDPG agent for my RL application with Matlab 2022a version.

I want to take action between 0 and 1 value. To do this, i use SigmoidLayer function at the final layer of the action. However, it exceed the 0-1 boundry. I also tried to use tanh with

scalingLayer(Scale=0.5,Bias=0.5);

,but it exceed the boundry again. How it can be possible?

Meanwhile, i also tried to use

actInfo = rlNumericSpec([1 1],LowerLimit=0,UpperLimit=1);

to limit action, yes it limits the action value but it doesn't scale it. it just act as a saturation block (like putting a saturation block in simulink in front of the action output). So, with this way, the RL works wrong.

How can achive to take action between 0 and 1?

3 commentaires
Afficher 1 commentaire plus ancienMasquer 1 commentaire plus ancien

awcii le 18 Juin 2023

than you for your reply. i solved it by reducing the noise variance now.

awcii le 19 Juin 2023

however, deacreasing the noise variance cause a lack of exploration during training. So, in totaly, i need a new solution.

Connectez-vous pour commenter.

Connectez-vous pour répondre à cette question.

Réponses (0)

Connectez-vous pour répondre à cette question.

Catégories

Control Systems Reinforcement Learning Toolbox Policies and Value Functions

En savoir plus sur Policies and Value Functions dans Help Center et File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

Action value exceeds the boundry of the final layer activation fucntion of the actor

3 commentaires
Afficher 1 commentaire plus ancienMasquer 1 commentaire plus ancien

Réponses (0)

Voir également

Catégories

Tags

Community Treasure Hunt

Action value exceeds the boundry of the final layer activation fucntion of the actor

3 commentaires Afficher 1 commentaire plus ancienMasquer 1 commentaire plus ancien

Réponses (0)

Voir également

Catégories

Tags

Community Treasure Hunt

3 commentaires
Afficher 1 commentaire plus ancienMasquer 1 commentaire plus ancien