| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by potsandpans 70 days ago

I agree. I run gemma4 31b int4 quant on my 5090 and find that's it's quite capable for self contained tasks. There are larger open weights models that are more capable, such as minimax and glm5.1.

I've toyed with the idea of buying two rtx 6000s and vlinking them. But the cost benefit value prop doesn't really pan out quite yet, still cheaper to use open router / some subscription plan for open weights.

I'm looking forward to continued optimization from the open weights labs / models. Qwen and gemma4 are quite capable.

Also I feel what's really under utilized is a suite of llm/ai tools that are completely open and runnable locally.

Hunyuan 3d 2.0, trellis2, unirig

Flux 2 dev, z image, qwen image edit

Ltx 2.3 / wan

Ace step 1.5

All great for creation pipelines. Couple those with other smaller things like sam2 and dino. It's very exciting to see these things producing high quality on local systems.