Hacker News new | ask | show | jobs
by Oxidation 1257 days ago
> Why not do make the function that outputs answers also feed itself "is this actually right/are you sure/is this not wrong"? Too expensive? Giant loop?

They do this, with humans. Both during training (they use supervised and reinforcement learning), and now at a much greater scale: it's what the free public access period is for and why there's a thumbs up/down button next to the output.