Hacker News new | ask | show | jobs
by ablyveiled 1025 days ago
You are aware RLHF tends to make models dumber, though, right?