Hacker News new | ask | show | jobs
by radarsat1 557 days ago
Hi, the Zephyr link may be what I'm looking for. yeah I'm quite familiar with RL already so it was specifically RLHF that I was asking about, I'll check out that resource, thanks!