Hacker News new | ask | show | jobs
by politelemon 81 days ago
Depends on what you're using it for, a small model could be viable as long as you're willing to absorb the maintenence overheads of running and deploying your own inference. A simple API would be much more cost effective especially if there are scaling requirements and time constraints.
1 comments

Use for support chat bot. I see a lot of open source models. Not sure if it's worth it. Reply via API from LLM should be better?