Hacker News new | ask | show | jobs
by sweezyjeezy 1021 days ago
That's probably more because of RLHF though, they've optimised for certain kind of responses rather than simple model loss on internet text.