| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by logicchains 102 days ago
	>What fundamental limitation of these large language models makes them unable to produce natural language? LLMs can get indefinitely good at coding problems by training in a reinforcement learning loop on randomly generated coding problems with compiler/unit tests to verify correctness. On the other hand, there's no way to automatically generate a "human thinks this looks like slop" signal; it fundamentally requires human time, severely limiting throughput compared to fully automatable training signals.