| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by BoorishBears 359 days ago
	Ok you're off in the land of "what if" and I can just flat out say: If you have a ZDR account there is no filtering on inference, no real-time moderation, no blocking. If you use their training infrastructure there's moderation on training examples, but SFT on non-harmful tasks still leads to a complete breakdown of guardrails very quickly.