Hacker News new | ask | show | jobs
by _1 301 days ago
> and with the goal you'll all finetune it for your use case.

What use-cases are a good fit for finetuning this model? More specific instruction following, knowledge from proprietary data, response tone?

2 comments

Any text to text use case with 32k context, especially if you're starting from the PT version you can finetune it to do whatever you need
I'm going to try training it on a codebook to see if such a small model would work for a TTS.