| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ausbah 343 days ago
	yeah like another commenter said, if you can get synthetic data with some some sort of easily verifiable grounding (math, games, code) models can do very well. this is one of the underpinnings of reinforcement learning that has helped some advancements in past year or so (AFAIK)