Y
Hacker News
new
|
ask
|
show
|
jobs
by
throwdbaaway
93 days ago
90% of what you pay in agentic coding is for cached reads, which are free with local inference serving one user. This is well known in r/LocalLLaMA for ages, and an article about this also hit HN front page few weeks ago.