Hacker News new | ask | show | jobs
by noname120 327 days ago
RLHF wasn't needed for Deepseek, only gobbling up the whole internet — both good and bad stuff. See their paper