|
|
|
|
|
by andreyk
2903 days ago
|
|
All that stuff is in part two! https://thegradient.pub/how-to-fix-rl/ Says as much at the end... to be fair we did warn up front "The first part, which you're reading right now, will set up what RL is and why it is fundamentally flawed. It will contain some explanation that can be skipped by AI practitioners." But personally I think the board game allegory is fun and that most people tend to forget the categorical simplicity of Go and Atari games and overhype ; easy to say the main points are not new but the details are important here. |
|