|
|
|
|
|
by jokethrowaway
268 days ago
|
|
the subject of this news likely doesn't generalise to random documents You need some serious resources to do this properly, think about granite docling model by IBM. For LLM:
Finetuning makes sense for light style adjustments with large models (eg. customize a chat assistant to sound a certain way) or to teach some simple transformations (eg. a new output format). You get away with 100-1000 samples. If you want to teach new behaviour you need a lot of data, likely too much to justify the investment for your average chatgpt wrapper AI company.
The pragmatic choice is often to just prompt engineer and maybe split your task and combine multiple prompts |
|