| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by pixl97 430 days ago
	>LLM’s etc can’t do that under current methodology I hate to give a smarmy result, but are you sure you know what RLHF is? Because this is one way to correct said data.

1 comments

I am aware of RLHF, and no it doesn’t solve this problem.

There’s a great deal of lesions to be learned from X PB of training data that wouldn’t be covered.