| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by jiito 224 days ago
	I haven't read this particular paper in-depth, but it reminds me of another one I saw that used a similar approach to find if the model encodes its own certainty of answering correctly. https://arxiv.org/abs/2509.10625