| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by brabel 306 days ago
	For quality to be comparable, you need to use a relatively big model, which will only work if you have around 64GB of RAM or more. The latest OpenAI local models (https://openai.com/index/introducing-gpt-oss/), for example, are really good, but you probably want the 120b to have results at least near what you get with their best cloud models, and that requires I think 80GB+. If you don't have that much, you can try stuff like the DeepSeek models, which are known for being ultra-efficient and runnable with "normal" computers, if you don't mind the politics of using that (and there are many models now that are similar!) but I haven't tried too many more to be able to comment. On my Macbook M1 Pro I can run the gpt-oss-20b model without issues and quite fast.

3 comments

KronisLV 306 days ago

I had pretty mixed experiences with the 20B version of GPT-OSS, sometimes that thing would just start looping in the thinking block and no sampler parameters would seem to do anything for specific questions.

That said Qwen3 and Qwen3 Coder are both pretty nice. Also ERNIE 4.5 if the benchmarks are to be trusted but I mostly run Ollama instead of vLLM now so can’t test it out atm (apparently llama.cpp added support for them recently though).

The models by Mistral might also be worth a look and personally I thought the EuroLLM project was also nice, but MoE models feel way more palatable on limited hardware.

Neither seem to be able to directly compete with Sonnet 4 or Gemini 2.5 Pro, would need way better hardware to come close.

link

nine_k 306 days ago

Hmm, well. So I need a 64GB MBP to run the AI tools, and another machine (likely running Linux) to run the system under development, since we're going all local. Well, doable.

link

bossyTeacher 306 days ago

>if you don't mind the politics of using that

what exactly are the "politics" of using DeepSeek? Feels weird to single out DeepSeek like that

link

kaashif 306 days ago

Using anything Chinese is political while using anything American or European is obviously totally apolitical?

Of course!

link

jakelazaroff 306 days ago

Not sure why parent is being downvoted here. Even without getting into whether it's possible for technology to be apolitical, many AI companies have explicitly political goals.

For example, OpenAI's charter is "to ensure that artificial general intelligence benefits all of humanity". They go on to list more specific political goals downstream from that: https://openai.com/charter/

link

layla5alive 305 days ago

And you believe them?

link

jakelazaroff 305 days ago

That's neither here nor there. The point is that it's expressly political, so using ChatGPT is every bit as political as using DeepSeek.

link