| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by pamelafox 352 days ago
	I ran bulk evaluations on a RAG scenario and wrote-up the results - discovered interesting differences (gpt-5 loves lists, smart quotes, and admitting it doesn't know).