Reinforcement Learning based quadrotor control using Soft Actor-Critic (the reward is not converging)

5 vues (au cours des 30 derniers jours)

Unmanned Aerial and Space Systems le 30 Avr 2022

0
Lien

Utiliser le lien direct vers cette question

https://fr.mathworks.com/matlabcentral/answers/1708930-reinforcement-learning-based-quadrotor-control-using-soft-actor-critic-the-reward-is-not-converging

Modifié(e) : Unmanned Aerial and Space Systems le 3 Août 2025

Hi, I am trying to control of a rotary wing UAV (quadrotor) by using Soft-Actor Critic methodology, but I have some problems, my reward is increasing continously after the point you see following image, what is the main problem, can you advice for this situation, I am sharing my files (Simulink and m-file). My max reward values should be zero as we define in reward function on Simulink file. This reward function indicates that the difference between desired trajectory and actual trajectory is about zero.