Hacker News new | ask | show | jobs
by Retric 432 days ago
I am aware of RLHF, and no it doesn’t solve this problem.

There’s a great deal of lesions to be learned from X PB of training data that wouldn’t be covered.