Problems in reinforcement learning training
2 vues (au cours des 30 derniers jours)
Afficher commentaires plus anciens
The effect of matlab reinforcement learning in the training process is better, but the reason for the poor effect after saving the agent is, or how to save the good effect in the training process
3 commentaires
Shantanu Dixit
le 2 Sep 2024
Assuming you're experiencing different training process before and after loading the saved agent, this could be due to following factors:
- Experience Buffer: By default, the experience buffer isn't saved with some agents like DDPG and DQN. If you plan to continue training the saved agent, consider setting 'SaveExperienceBufferWithAgent' to true to preserve the experience buffer.
- Non-Determinism and Exploration Strategy: Training may involve stochastic elements, causing the agent to explore different trajectories after being reloaded, which could result in a different training process.
Additionaly you can refer to 'SaveAgentCriteria' and 'SaveAgentValue' to save agents that meet specific performance criteria.
Refer to the below MathWorks documentation for different saving strategies:
Réponses (0)
Voir également
Catégories
En savoir plus sur Training and Simulation dans Help Center et File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!