Red-Portal@alien.topBtoMachine Learning@academy.garden•[R] Rethinking Open'sAI's Q-Learning : Insights from the Award-Winning 'Non-delusional Q-learning' PaperEnglish
1·
1 year agoWe don’t even know whether it’s actually an RL approach lol
We don’t even know whether it’s actually an RL approach lol
This sounds like a classic-as-in-textbooks example of a system identification problem in control. Have you taken a look at this setting and how people solve it?
Not only that, the numerical computation and high performance computing landscape of Rust is very primitive.