Hacker News new | ask | show | jobs
by ChrisCinelli 1031 days ago
I wonder why is much more expensive.
1 comments

They would need to store and load the model, even if I imagine that they are using something similar to LoRA to finetune their models.
i would guess that the ideal price is also to raise the charge to make finetuning a last resort rather than a first resort; its probably much better cost- and research-wise if everybody just prompts the same model than silo off in their own minimodels.
I don't think I'd consider it a 'last resort', since a lot of people will be choosing between finetuned GPT-3.5 and non-finetuned GPT-4, in which case finetuning is the cheap option.