Hacker News new | ask | show | jobs
Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?
7 points by limondas 19 hours ago
Claude Code like agentic workflow ai too costly for me.Any LLM can I run with VSCode at the below setup? 16ram Intel core i7 h processor 13gen 512gb NVMe SSD I want to run the ai as local agentic workflow with Vscode.I want use LLAMA agent to build it. I have tried Gemma3 270M, Qwen3.5 4bit ,they are fast but less coding focused. I will be grateful for your response. Thank you.
2 comments

how about opencode's go ( https://opencode.ai/go ) it is very cost effective and the models are surprisingly competent....
Try qwen3-coder or qwen3-coder-next models which fit into your configuration. This is team-of-experts model which may load only actual experts into GPU.
Thanks for your reply. But it's to big for my PC. In PC around 1.5GB models got 20 token/s , which is too low for agentic workflow.
try latest gemma4:12b. It fits into 16Gb with 256K context window