深層強化学習（DQN）における誤差関数をHuber関数に変更することは可能でしょうか。

%DQNエージェントの構築
criticNetwork=[
    featureInputLayer(7,'Normalization','none','Name','state')
    fullyConnectedLayer(64,'Name','fc1')
    reluLayer('Name','relu1')
    fullyConnectedLayer(64,'Name','fc2')
    reluLayer('Name','relu2')
    fullyConnectedLayer(64,'Name','fc3')
    reluLayer('Name','relu3')
    fullyConnectedLayer(5,'Name','action')];
criticOpts = rlRepresentationOptions('LearnRate',0.001,'Optimizer',"rmsprop");
critic = rlQValueRepresentation(criticNetwork,obsInfo,actInfo,'Observation',{'state'},'Action',{'action'},criticOpts);
agentOptions = rlDQNAgentOptions(...
    'SampleTime',Ts,...
    'TargetSmoothFactor',1,...
    'TargetUpdateFrequency',2,...
    'ExperienceBufferLength',50000,...
    'ResetExperienceBufferBeforeTraining',false,...
    'SaveExperienceBufferWithAgent',true,...
    'NumStepsToLookAhead',5,...
    'UseDoubleDQN',true,... 
    'MiniBatchSize',32,...
    'DiscountFactor',0.99);
agentOptions.EpsilonGreedyExploration.Epsilon =1;
agentOptions.EpsilonGreedyExploration.EpsilonDecay=0.0097566;
agentOptions.EpsilonGreedyExploration.EpsilonMin=0.02;
agent = rlDQNAgent(critic,agentOptions);
end

1 commentaire
Afficher -1 commentaires plus anciens Masquer -1 commentaires plus anciens

一馬平田 le 9 Sep 2021

すいません。自己解決しました。

Connectez-vous pour commenter.

Connectez-vous pour répondre à cette question.

Follow Question

Réponses (0)

Connectez-vous pour répondre à cette question.

Catégories

En savoir plus sur Reinforcement Learning dans Centre d'aide et File Exchange

Produits

Version

R2021a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

深層強化学習（DQN​）における誤差関数を​Huber関数に変更​することは可能でしょ​うか。

1 commentaire Afficher -1 commentaires plus anciens Masquer -1 commentaires plus anciens