Hacker News new | ask | show | jobs
by antirez 650 days ago
It's very surprising that the author of this post does 99% of the work and writing and then does not go forward for the other 1% downloading ollama (or some other llama.cpp based engine) and testing how some decent local LLM works in this use case. Because maybe a 7B or 30B model will do great in this use case, and that's cheap enough to run: no GPT-4o needed.
1 comments

Not OP, but thanks for the suggestion. I’m starting to play around with LLMs and will explore locally hosted versions.