|
You do realize how possible it is to fine tune a task like this (along with a hundred others in a similar vein) on a tiny model you can scale on your own hardware? I've run hundreds of millions (150m so far in a couple of weeks of non-continuous running as I tweaked things) of tokens through my 2x 3090 with a 13b llama2 model I fine tuned on tasks like: summary, knowledge graph generation, writing using the knowledge graph, grammar, spelling, and transcription correction, etc. This type of stuff is going to be done at scale with a modest budget if you have the skills to tune more efficient and faster models to your use cases. |
[1]: https://imgur.com/a/S9jnHWJ