Réponse apportée
reinforcement learning PMSM-code
You can find the example here.

presque 5 ans il y a | 0

| A accepté

Réponse apportée
How to deal with a large number of state and action spaces?
Even if the NX3 inputs are scalars, I would reorganize them into an "image" and use imageInput layer for the first layer as oppo...

presque 5 ans il y a | 0

Réponse apportée
Q learning algorithm in image processing using matlab.
Hello, Finding an example that exactly matches what you need to do may be challenging. If you are looking for the "deep learnin...

presque 5 ans il y a | 0

| A accepté

Réponse apportée
Need help with Model based RL
Hello, If you want to use the existing C code to train with Reinforcement Learning Toolbox, I would use the C caller block to b...

presque 5 ans il y a | 1

| A accepté

Réponse apportée
How to set the reinforcement learning block in Simulink to output 9 actions
Hello, the example you are referring to does not output 3 values for the pid gains. The PID gains are "integrated" into the neu...

presque 5 ans il y a | 0

Réponse apportée
Where to update actions in environment?
Reinforcement Learning Toolbox agents expect a static action space, so fixed number of options at each time step. To create a dy...

presque 5 ans il y a | 0

Réponse apportée
How to check the weight and bias which taked by getLearnableParameters?
Can you provide some more details? What does 'wrong answer' mean? How do you know the weights you are seeing are not correct? Ar...

presque 5 ans il y a | 0

Réponse apportée
Gradient in RL DDPG Agent
If you put a break point right before 'gradient' is called in this example, you can step in and see the function implementation....

presque 5 ans il y a | 0

| A accepté

Réponse apportée
Soft Actor Critic deploy mean path only
Hello, Please take a look at this option here which was added in R2021a to allow exactly the behavior you mentioned. Hope this...

presque 5 ans il y a | 0

| A accepté

Réponse apportée
How to pretrain a stochastic actor network for PPO training?
Hello, Since you already have a dataset, you will have to use Deep Learning Toolbox to get your initial policy. Take a look at ...

presque 5 ans il y a | 1

Réponse apportée
Failure in training of Reinforcement Learning Reinforcement Learning Onramp
Hello, We are aware and working to fix this issue. In the meantime, can you take a look at the following answere? https://www....

presque 5 ans il y a | 0

Réponse apportée
DQN Agent with 512 discrete actions not learning
I would initially revisit the critic architecture for 2 reasons: 1) Network seems a little simple for a 3->512 mapping 2) This...

presque 5 ans il y a | 0

Réponse apportée
How does the Q-Learning update the qTable by using the reinforcement learning toolbox?
Can you try critic.Options.L2RegularizationFactor=0; This parameter is nonzero by default and likely the reason for the discre...

presque 5 ans il y a | 0

Réponse apportée
File size of saved reinforcement learning agents
Hello, Is this parameter set to true? If yes, then it makes sense that mat files are growing in size as the buffer is being pop...

presque 5 ans il y a | 0

| A accepté

Réponse apportée
Saving Trained RL Agent after Training
Setting the IsDone flag to 1 does not erase the trained agent - it actually makes sense that the sim was not showing anything be...

presque 5 ans il y a | 0

| A accepté

Réponse apportée
How to Train Multiple Reinforcement Learning Agents In Basic Grid World? (Multiple Agents)
Training multiple agents simultaneously is currently only supported in Simulink. The predefined Grid World environments in Reinf...

presque 5 ans il y a | 1

| A accepté

Réponse apportée
How to create a neural network for Multiple Agent with discrete and continuous action?
If you want to specify the neural network structures yourself, there is nothing specific you need to do - simply create two acto...

presque 5 ans il y a | 0

| A accepté

Réponse apportée
Is it possible apply Reinfocrement Learning to classify data?
If you already have a labeled dataset, supervised learning is the way to go. Reinforcement learning is more for cases where data...

presque 5 ans il y a | 0

| A accepté

Réponse apportée
Combining two deep neural networks to train simultaneously
Hello, You can do this in Simulink - see the following examples for reference. https://www.mathworks.com/help/reinforcement-l...

presque 5 ans il y a | 1

| A accepté

Réponse apportée
DQN learns at first but then worsens.
To confirm that this is an exploration issue, can you try setting the EpsilonMin param to a high value? e.g. 0.99. If after doin...

presque 5 ans il y a | 0

Réponse apportée
How to resume train a trained agent?about Q learning agents.
Hello, To see how to iew the table values, take a look at the answer here. Also, you don't have to do anything specific to con...

presque 5 ans il y a | 1

| A accepté

Réponse apportée
Reinforcement learning action getting saturated at one range of values
Your scaling layer is not set up correctly. You want to scale to (upper limit-lower limit)/2 and then shift accordingly. scali...

presque 5 ans il y a | 1

| A accepté

Réponse apportée
How can I provide constraints to the actions provided by the Reinforcement Learning Agent?
Hard constraints are not typically supported during training in RL. You can specify limits/constraints as you mention above, but...

presque 5 ans il y a | 0

| A accepté

Réponse apportée
Exporting data only works as pdf. Axis labels are getting small and unreadable
You cannot save as .fig from the episode manager plot. If you have the training data though (it's good practice to save this dat...

presque 5 ans il y a | 1

| A accepté

Réponse apportée
Reinforcement Learning multiple agent validation: Can I have a Simulink model host TWO agents and test them
That should be possible. Did you follow the multi-agent examples? Since the agents are trained already you may want to check the...

presque 5 ans il y a | 0

| A accepté

Réponse apportée
Do the actorNet and criticNet share the parameter if the layers have the same name?
No, each network has its own parameters. Shared layers are not supported out of the box, you would have to implement custom trai...

presque 5 ans il y a | 0

| A accepté

Réponse apportée
Any RL Toolbox A3C example?
Hello, To get an idea of what an actor/critic architecture may look like, you can use the 'default agent' feature that creates ...

presque 5 ans il y a | 1

| A accepté

Réponse apportée
After training my DDPG RL agent and saving it, unexpected simulation output
See answer here

presque 5 ans il y a | 0

| A accepté

Réponse apportée
Saved agent always gives constant output no matter how or how much I train it
The problem formulation is not correct. I suspect that even during training, you are seeing a lot of bang bang actions. The bigg...

presque 5 ans il y a | 1

| A accepté

Réponse apportée
How can I create a Reinforcement Learning Agent representation based on Recurrent neural network (RNN, LSTM, among others)
Hello, Which release are you using? R2020a and R2020b support LSTM policies for PPO and DQN agents. Starting in R2021a you can ...

presque 5 ans il y a | 2

| A accepté

Charger plus