Y
Hacker News
new
|
ask
|
show
|
jobs
by
sashank_1509
1143 days ago
They used to offer exactly this for fine tuning models. Never offered it after ChatGPT, I think the difficulty comes with fine tuning RLHF models, not obvious how to correctly do this.