Hacker News new | ask | show | jobs
by apwell23 253 days ago
> fine tuning because the work was repetitive enough that FT provided benefits in terms of speed and accuracy,

can you share anymore info on this. i am curious about what the usecase was and how it improved speed (of inference?) and accuracy.

1 comments

Very typical e-commerce use cases processing scraped content: product categorization, review sentiment, etc. where the scope is very limited. We would process tens of thousands of these so faster inference with a cheaper model with FT was advantageous.

Disclaimer: this was in the 3.5 Turbo "era" so models like `nano` now might be cheap enough, good enough, fast enough to do this even without FT.