| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by mememememememo 125 days ago
	Not sure it is true LLMs don't see code or cli commands directly in their training. They go through reinforcement learning and they could easily be trained on a command line. People are paid to give human feedback. See https://huyenchip.com/2023/05/02/rlhf.html