| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by momojo 53 days ago
	How long has this been your daily driver? How has this setup worked for you compared to enterprise models? Which models?

2 comments

lrvick 53 days ago

Maybe 2 months. I have mostly used the Qwen series, and currently running Qwen3.6 27B for programming and debugging and Qwen3.6 35B for speed and research. Both punch way way above their weight and replaced Qwen3.5 122B for me. Qwen 3.6 27B even is, for my workloads, preferable over Big Pickle (GLM-4.6) which is the only large third party model I have used extensively for reference and comparison as it is free and requires no signup or PII via OpenCode. My go to agent solution though is Charm Crush.

link

toomuchtodo 52 days ago

Do you have a write up available on your build? A friend is looking for a similar solution where they can offer an API/service for internal use.

link

lrvick 51 days ago

Not really much to write up.

Insert 2-4 $1200 r9700 GPUs in a Linux 7.0.0+ machine with 64GB+ of DDR4-5 memory, fire up llama.cpp, and connect with any OpenAI compatible tools.

A free public anonymous LLM like BigPickle can easily set up the software for you if in doubt.

link

throw1234567891 53 days ago

You are not shipping all your intellectual property to a third party. There’s nothing more valuable than that.

link

Frannky 52 days ago

Yeah, the point was mostly that you can offload a lot of stuff to AI + code — stuff that before you would have needed people for.

Obviously, it becomes better to have local models running on your own hardware — that will be best. I don't think we are there yet, though. Software, yes. If you tweak Pi and DeepSeek Pro, you can get Claude-code-level stuff. You'd still need to buy the hardware, though. Not cheap. Eventually, it will get very cheap.

link