DDPG Agent OU noise options to favour exploration

3 vues (au cours des 30 derniers jours)

Abd Al-Rahman Al-Remal le 22 Juil 2021

0
Lien

Utiliser le lien direct vers cette question

https://fr.mathworks.com/matlabcentral/answers/883684-ddpg-agent-ou-noise-options-to-favour-exploration

Modifié(e) : Abd Al-Rahman Al-Remal le 22 Juil 2021

Hi there,

I have seem similar posts here however I haven't found one that explains how to actually tune the OU noise parameters to favour exploration - currently my agent is stuck on the same reward value from the beginning and does not change/train/learn.

Can anyone advise on how to tune the OU noise parameters within the code to favour exploration? Currently mine are:

agentOpts.NoiseOptions.StandardDeviation = 0.3;

agentOpts.NoiseOptions.StandardDeviationDecayRate = 1e-5;

agentOpts.NoiseOptions.MeanAttractionConstant = 2e-3;

This worked for a previous similar model I made however I understand that the parameter smust be modified per model however I don't know how and literature all looks very dense and doesn't give a clear answer.

Thanks in advance!

Abd