| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by davidlee1435 2603 days ago
	I wonder if you could make an RL version of a GPT-2 model specifically optimized for code, where you try to compile the output and penalize whenever there is an exception/error that's thrown.