Do you have any tutorials or examples with regards to time series data and reinforcement learning? It seems very promising but I can't find any examples at all and I am at a loss as to how to implement this in Matlab, especially when creating a new environment.
A simple example would just be a simulation of stock with 1 year data of let's say 1 minute historic open, close, low, high, volume values. The actions would be buy (1), hold (0) and sell (-1). The environment output would assumably be the open, low, volume of that particular time (t). There would also be some indication that the stock has been previously purchased, or sold in there. Seems relatively simple but I think I'm just missing how to get the data into the environment and then setting it up.
Thanks for checking this out.