Hacker News new | ask | show | jobs
by make3 254 days ago
isn't that just instruction fine tuning and rlhf inducing style & deference? why is that surprising