Hacker News new | ask | show | jobs
by sesm 77 days ago
Don't we already have "RLHF on synthetic data"?