|
|
|
|
|
by Oxidation
1257 days ago
|
|
> Why not do make the function that outputs answers also feed itself "is this actually right/are you sure/is this not wrong"? Too expensive? Giant loop? They do this, with humans. Both during training (they use supervised and reinforcement learning), and now at a much greater scale: it's what the free public access period is for and why there's a thumbs up/down button next to the output. |
|