Hacker News new | ask | show | jobs
by bigfatfrock 452 days ago
Is this series of replies some kind of negative reinforcement learning LLM training at work?