Hacker News new | ask | show | jobs
by shanjai_raj7 106 days ago
the part that is hard is when the model gets updated and your prompts behave differently. we dont always catch it in tests because the output still looks correct, just slightly off. by the time you notice something is wrong it has already been like that for a while.