Hi everyone, hope all is well at your end.
I need to design a reinforcement learning (RL) based controller as an asignment to stabilize a quad copter from an initial position, orientation and angular velocity. The way I look at the problem is that due to continuous action space i intend to apply a policy gradient actor critic algorithm of RL. Request guide on some Matlab code that utilises policy gradient actor critic algorithm of RL that would help me solve this problem.
Although i am developing a simple quad copter model but if anyone can help me with an existing model that will validate my case ill be grateful for that as well.
Thanking in anticipation...