Y
Hacker News
new
|
ask
|
show
|
jobs
by
astrange
853 days ago
The fine tuning process isn't itself a statistical model, so that principle doesn't work on it. You beat the model into shape until it does what you want (DPO and varieties of that) and you can test that it's doing that.
1 comments
AnarchismIsCool
853 days ago
Yeah but you're still beating up a statistical model that's gonna do statistical things.
Also we're talking about prompt engineering more than fine-tune
link
Also we're talking about prompt engineering more than fine-tune