Hacker News new | ask | show | jobs
by dabockster 310 days ago
I've been using a lot of the Chinese open source models like R1 and the Qwen coder series. I've also been trying some community mods of them from HuggingFace.

I was using a combo of Ollama + Roo Code for the front/back end but Ollama is kind of dumb when it comes to memory overload protections. I've since switched to LM Studio and it has a very annoying hard timeout of 2-3 minutes on its API server. Local isn't perfect right now, but when it does run you can see the potential as clear as day.