Hacker News new | ask | show | jobs
by mememememememo 78 days ago
Not sure it is true LLMs don't see code or cli commands directly in their training. They go through reinforcement learning and they could easily be trained on a command line. People are paid to give human feedback. See https://huyenchip.com/2023/05/02/rlhf.html