How to use the reinforcement learning toolbox in Matlab to implement delayed reward
Afficher commentaires plus anciens
I want to implement delayed reward with matlab code. For example, I need to wait until the end of my current episode before giving the reward for each action in this episode. How can I achieve this?
Réponse acceptée
Plus de réponses (1)
MOHAMMADREZA
le 5 Mar 2025
0 votes
Hi, I am having the same problem. Hwever, I am using the Matlab heper (class) for environment. I do not know how to handle reward so that at the end of episode the reward is used for updating the parameters. More specifically, when using class template, I have step, reset,... functions. when the parameters is updated? is it after running step function? I wrote the reward in the step function. but I need to update the parameters only at the end of episode.
Catégories
En savoir plus sur Reinforcement Learning dans Centre d'aide et File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!