Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?

Y	Hacker News new \| ask \| show \| jobs

	Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?
	7 points by limondas 19 hours ago
	Claude Code like agentic workflow ai too costly for me.Any LLM can I run with VSCode at the below setup? 16ram Intel core i7 h processor 13gen 512gb NVMe SSD I want to run the ai as local agentic workflow with Vscode.I want use LLAMA agent to build it. I have tried Gemma3 270M, Qwen3.5 4bit ,they are fast but less coding focused. I will be grateful for your response. Thank you.

2 comments

slipwalker 5 hours ago

how about opencode's go ( https://opencode.ai/go ) it is very cost effective and the models are surprisingly competent....

link

denn-gubsky 18 hours ago

Try qwen3-coder or qwen3-coder-next models which fit into your configuration. This is team-of-experts model which may load only actual experts into GPU.

link

limondas 16 hours ago

Thanks for your reply. But it's to big for my PC. In PC around 1.5GB models got 20 token/s , which is too low for agentic workflow.

link

denn-gubsky 8 hours ago

try latest gemma4:12b. It fits into 16Gb with 256K context window

link