|
|
|
|
|
by rdedev
1133 days ago
|
|
It's pretty straightforward to build an RL environment for closed systems like chess but I don't think it's close enough for an AGI to learn. Like RLHF uses human feedback. Unless we come up with a way to scale that process AGI by this year doesn't seem possible |
|