| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by pamelafox 355 days ago
	I just ran evaluations of gpt-5 for our RAG scenario and was pleasantly surprised at how often it admitted “ I don’t know” - more than any model I’ve eval’d before. Our prompt does tell it to say it doesnt know if context is missing, so that likely helped, but this is the first model to really adhere to that.