|
|
|
|
|
by rvz
472 days ago
|
|
Note: What makes this interesting is that this is a pre-LLM project which shows that in some projects you don't need an "LLM" for this. All you need is just a plain old reinforcement learning algorithm and a deep neural network which is perfect for this. This is what I want to see more of and goes against the hype of LLMs. What a great RL project. Meanwhile, "Claude" is still stuck somewhere in the game. Imagine the costs of running that vs this project. |
|