Y
Hacker News
new
|
ask
|
show
|
jobs
by
6510
319 days ago
Noob question: Would it be possible to train a small model for a single prompt?
1 comments
GabrielBianconi
319 days ago
With supervised fine-tuning (SFT), you'll often see good results with 100-1000+ datapoints (they can be variations of the same prompt template). If you have more limited data, reinforcement fine-tuning (RFT) can work well in the 10-100 range.
Good luck!
link
Good luck!