Hacker News new | ask | show | jobs
by pixl97 430 days ago
>LLM’s etc can’t do that under current methodology

I hate to give a smarmy result, but are you sure you know what RLHF is? Because this is one way to correct said data.

1 comments

I am aware of RLHF, and no it doesn’t solve this problem.

There’s a great deal of lesions to be learned from X PB of training data that wouldn’t be covered.