|
|
|
|
|
by hb-robo
516 days ago
|
|
Layman question here since this isn't my field: how do you achieve success on closed-system tasks without supervision? Surely at some point along the way, the system must understand whether their answers and reasoning are correct. |
|
Basically, they have an external source-of-truth that verifies whether the model's answers are correct or not.