I wonder how difficult it could be to bring Reinforcement Learning support into the library? Here is a small discussion. I think that supervised methods should work well with modern RL algorithms, like, DQN and playback buffers. So probably optimal strategies learning could somehow benefit from features implemented in the library.
1 Like