| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by Lucent 754 days ago
	This is an incredible relief and should be the final nail in the coffin for safety/alignment/shoggoth arguments. It turns out features are completely scrutable, and when modified, we don't see chaotic, schizo non-sequiturs, but a coherent, predictable, globally-consistent shift proving models are operating in a fundamentally understandable way.