Hacker News new | ask | show | jobs
by sashank_1509 1143 days ago
They used to offer exactly this for fine tuning models. Never offered it after ChatGPT, I think the difficulty comes with fine tuning RLHF models, not obvious how to correctly do this.