Réponse apportée
Reinforcement learning deployment in real-time system
Hello, To generate code from a trained policy, you should follow the process shown here. Note that this is a MATLAB-based workf...

plus de 5 ans il y a | 0

| A accepté

Réponse apportée
Implementation of Proximal Policy Optimisation
Hello, It seems you want to use PPO with continuous action space. If that's the case, your actor network does not have the righ...

plus de 5 ans il y a | 1

| A accepté

Réponse apportée
DDPG agent has saturated actions with diverging Q value
For the actor switching between extreme actions, please refer to this answer - sounds relevant.In short, make sure you include a...

plus de 5 ans il y a | 0

Réponse apportée
Can I use NN built with Fitnet for Reinforcement Learning toolbox with DQN agent?
Hi Abhay, Reinforcement Learning Toolbox currently supports the layers supported by Deep Learning Toolbox only. You could try c...

plus de 5 ans il y a | 0

| A accepté

Réponse apportée
how I can connect Agent's action to set block parameter?
Hello, The most straightforward way is if the block accepts external input that modifies these parameters (for example like the...

plus de 5 ans il y a | 0

| A accepté

Réponse apportée
PPO agent applied to ACC model
Hello, Can you make sure that you set up your actor following a structure similar to this one? It seems that your variance path...

plus de 5 ans il y a | 0

Réponse apportée
Problems to set up the reset function in Reinforcement learning environment
Maybe I am missing something, but why don't you add a couple of lines that call 'setBlockParameter' with the appropriate path to...

plus de 5 ans il y a | 0

Réponse apportée
Easy way to evaluate / compare the performance of RL algorithm
Why not use a MATLAB Fcn block and implement the dummy agent in there? If you want random/constant actions should be just one li...

plus de 5 ans il y a | 1

Réponse apportée
Is it possible to train LSTM Network without a Dataset?
In the paper they mention "Although a readily available dataset is required to train an LSTM network, we devised an efficient wa...

plus de 5 ans il y a | 1

| A accepté

Réponse apportée
Reinforcement learning: "NextObs" vs. "LoggedState" in step function
Actually, NextObs is the important thing here. It represents the value of your states after you apply current action and integra...

plus de 5 ans il y a | 0

Réponse apportée
What's the purpose of adding a transfer function after a Integrator block?
Hello It is likely to filter high-frequency content. Hope that helps.

plus de 5 ans il y a | 0

Réponse apportée
PPO agent with continuous action example
Hello, If you want to use PPO, i.e. a stochastic actor with continuous action space, you can follow the structure shown here.

plus de 5 ans il y a | 0

Réponse apportée
Environment for Reinforcement Learning Project
Hello, We are working on providing an interface between OpenAI Gym and Reinforcement Learning Toolbox but this will take some m...

plus de 5 ans il y a | 1

| A accepté

Réponse apportée
How do I properly substitute rlRepresentation with rlValueRepresentation, rlQValueRepresentation, rlDeterministicActorRepresentation, and rlStochasticActorRepresentation?
It would be helpful if you pasted the exact MATLAB code you are typing to see what the problem is. I suspect you simply changed ...

plus de 5 ans il y a | 0

Réponse apportée
Deep Q Learning - define an adaptive critic learning rate?
Hi Niklas, I believe this is currently not supported. This is an interesting usecase though - I will inform the development tea...

plus de 5 ans il y a | 0

| A accepté

Réponse apportée
Build Environment reinforcement learning
Hello, For Simulink environments, the following page should be helpful: https://www.mathworks.com/help/reinforcement-learning/...

plus de 5 ans il y a | 0

Réponse apportée
Using Reinforcement Learning algorithm to optimize parameter(s) of a controller
Hi Hazwan, The main difference between using RL for control vs parameter tuning is that in the first case the policy will direc...

plus de 5 ans il y a | 1

| A accepté

Réponse apportée
Initializing pimp-controller failed: Error binding to tcp://*: no free port in range 9620-9620
Hello, I would contact technical support for this, and show them how to reproduce the error. If the issue is in the communicati...

plus de 5 ans il y a | 0

Réponse apportée
Can LoggedSignal in provided Link contain more than just the state?
LoggedSignals is not tied to the state or the observations, so you should be able to store whatever makes sense to you in that v...

plus de 5 ans il y a | 0

| A accepté

Réponse apportée
Using getValue in matlab fcn block in simulink
Hi Sam, Before R2020a, the easiest way to bring the critic in Simulink without using the Agent block is to call generatePolicy...

plus de 5 ans il y a | 1

Réponse apportée
Multi action agent programming in reinforcement learning
This example shows how to create an environment with multiple discrete actions. Hope that helps

plus de 5 ans il y a | 0

Réponse apportée
Incorporate Time into Reinforcement Learning Environment
Time would be another parameter of your environment. Interactions between the agent and environment happen at discrete time step...

plus de 5 ans il y a | 1

| A accepté

Réponse apportée
How to view the output of rlNumericSpec?
Hi Jacob, I think what you want to do is take the output of the agent and do the transformation you mention (not the output of ...

plus de 5 ans il y a | 0

Réponse apportée
Create and Train DQN Agent with just a State Path and Not Action Path
Hello, This page shows how this can be done in 20a. We will have examples that show this workflow in the next release. Hope th...

plus de 5 ans il y a | 1

| A accepté

Réponse apportée
To choose an action, is it correct to compute the value of successor state or do we need to compute value of states in the entire path till end state?
Hi Gowri, Using the Q value for a state+action pair encodes all the information till 'the end of the path' weighted by a discou...

plus de 5 ans il y a | 1

| A accepté

Réponse apportée
Agent repeats same sequence of actions each episode
Hi Braydon, I am not really sure why you are only looking at the first two episodes. RL can take thousands of episodes to conve...

plus de 5 ans il y a | 0

| A accepté

Réponse apportée
How to create an custom Reinforcement Learning Environment + DDPG agent
Hi Kai, What the very first error is telling you is that there is an issue with the dimensions of either your observation, rewa...

plus de 5 ans il y a | 1

| A accepté

Réponse apportée
Reinforcement Learning Sample Time
Hi Braydon, The agent sample time effectively determines how often the agent will output a decision/action. Think of it as the...

plus de 5 ans il y a | 0

Réponse apportée
2 out of 7 Observations Defined in MATLAB DDPG Reinforcement Learning Environment. Are the rest given random values?
Hello, I am assuming you are referring to the initialization of x and y inside the "flyingRobotResetFcn" function. Basically, i...

plus de 5 ans il y a | 0

| A accepté

Réponse apportée
RL Agent does not learn
Hello, It is really hard to say just by looking at the training plot. The first thing I would try is 1) a different agent (mayb...

plus de 5 ans il y a | 0

| A accepté