| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ThrowawayR2 52 days ago
	> "There is probably a whole testing workflow at AI companies to tweak each new model until it "looks" acceptable." Isn't that what the RLHF phase does ( https://www.paloaltonetworks.com/cyberpedia/what-is-rlhf )?