|
|
|
|
|
by thomasskis
952 days ago
|
|
How about using an LLM to help you write a MuZero-like model designed for a specific task? (Also MuZero took like 12 hours to train on old hardware, so my MacBook might be good enough here) obviously we’re not here yet, but it doesn’t seem far away. Hell, you could train a small LLM specifically just to do this. |
|