- Ensure that “cos(psi(t)) - cos(psi(t-1)) > 0” is met frequently by checking if psi(t) increases over time.
- Verify that the initial conditions of the delay block are properly set to prevent incorrect first-step evaluations.
- Modify the reward function threshold to make the positive reward condition more lenient.
reward error during training
2 vues (au cours des 30 derniers jours)
Afficher commentaires plus anciens
Hello,
Im using reinforcemet designer to train my model,
and here is my problem.
Q. I dont not why my reward cannot go up to 0.1, why is this happen?? How can I fix this??


0 commentaires
Réponses (1)
Harsh
le 28 Fév 2025
Hi @기범
In Reinforcement Learning (RL), the reward is a signal that guides the agent’s learning by providing feedback on its actions. It changes dynamically based on the agent’s behavior and the environment’s response. A well-designed reward function encourages desired actions and discourages unwanted ones, leading to improved performance over time.
You can try the following to improve your performance:
For more information on how to craft a reward function please refer to the following MATLAB tech talks webinar - https://www.mathworks.com/videos/reinforcement-learning-part-2-understanding-the-environment-and-rewards-1551976590603.html
0 commentaires
Voir également
Catégories
En savoir plus sur Environments dans Help Center et File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!