Hacker News new | ask | show | jobs
by jonahbenton 298 days ago
I'm using phi4 for writing, one of the qwen3s for coding and a small mistral for classification and other small tasks. Have a framework desktop showing up soon, will put a 70b/80b multimodal on it for image and pdf processing.

I have used ollama, lmstudio, jan and vllm at different times, am readying for a wholesale transition to llamacpp.

1 comments

Yo! You’re a serious guy, Jonah!

I run Qwen3 locally for coding and writing. It’s a solid model.

Framework Desktop is becoming really popular. The one with the Max+ 395 and 128GB of RAM is an absolute beast. I might buy a Beelink GTR9 Pro (Max+ 395 with 128GB RAM), which costs around $2,000.

llama.cpp is the real deal. I’m using it as the engine for the product I’m building right now (https://tygra.ai/).