| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by shanjai_raj7 106 days ago
	the part that is hard is when the model gets updated and your prompts behave differently. we dont always catch it in tests because the output still looks correct, just slightly off. by the time you notice something is wrong it has already been like that for a while.