|
|
|
|
|
by faxmeyourcode
110 days ago
|
|
Labeling or categorization tasks like this are the bread and butter of small fine tuned models. Especially if you need outputs in a specific json format or whatever. I did an experiment where I did very simple SFT on Mistral 7b and it was extremely good at converting receipt images into structured json outputs and I only used 1,000 examples. The difficulty is trying to get a diverse enough set of examples, evaling, etc. If you have great data with simple input output pairs, you should really give it a shot. |
|