Encountering problems in creating a Simulink interactive time-sequenced reinforcement learning environment.
Afficher commentaires plus anciens
I want to set up an online learning environment for PPO in Simulink, and the status input is 2*100 time series data, and I would like to ask how to implement it. observationInfo = rlNumericSpec([2 100]); When I use this code, I get an error.
错误使用 rl.internal.validate.mapFunctionObservationInput (第 50 行)
Model input sizes must match the dimensions specified in the corresponding observation and action info specifications.
出错 rlDiscreteCategoricalActor (第 86 行)
model = rl.internal.validate.mapFunctionObservationInput(model,...
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
出错 rl_demo (第 80 行)
actor = rlDiscreteCategoricalActor(actorNetwork,observationInfo,actionInfo);
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Réponse acceptée
Plus de réponses (0)
Catégories
En savoir plus sur Deep Learning Toolbox dans Centre d'aide et File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!