| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by Oxidation 1257 days ago
	> Why not do make the function that outputs answers also feed itself "is this actually right/are you sure/is this not wrong"? Too expensive? Giant loop? They do this, with humans. Both during training (they use supervised and reinforcement learning), and now at a much greater scale: it's what the free public access period is for and why there's a thumbs up/down button next to the output.