| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by hgoel 13 days ago
	Agreed, the biggest takeaway from how much Anthropic puts into alignment, and still ends up with a model that can end up doing things that are clearly out of alignment, should be that alignment is very tricky.