| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by chilipepperhott 12 days ago
	I find any and all claims like this ridiculous from a company who can't build a terminal application that uses less than a gigabyte of RAM.

10 comments

dang 12 days ago

"Please don't post shallow dismissals, especially of other people's work. A good critical comment teaches us something."

https://news.ycombinator.com/newsguidelines.html

link

thesmtsolver2 12 days ago

For some reason, idling Claude Code needs 100% of my CPU.

link

nicce 12 days ago

Like Google’s AI studio tab in browser. Incredible degrade in software quality?

link

asdfman123 12 days ago

Developers can develop leaner applications, but they're usually not incentivized to.

Frankly, I love efficiency too, but I've hard to learn the hard way that what the market wants is features. Or at the very least, the executive team wants that.

link

j2kun 12 days ago

Their whole argument is that AI's added efficiency means they don't need to set aside valuable human time anymore. Why can't they just point Claude at Claude Code and ask it to reduce memory usage by 90%?

link

asdfman123 12 days ago

You can do that. But I'm telling you, in tech (and enterprise shops I've worked at too) they don't care.

I'm using the internal Google tools and it's helping me write code much faster too, but it still takes time. I could make the CLI tool I work on faster, but no one cares except the end users, and their minor concerns have no impact on our internal politics.

At the end of the day you have to do what you're paid to do, unfortunately.

link

fg137 12 days ago

In other words, performance is almost always an afterthought.

link

Garlef 12 days ago

Make it work, make it nice, make it fast.

link

jachee 12 days ago

Good, Fast, Cheap.

Pick any two.

link

asdfman123 12 days ago

Sure

link

toephu2 12 days ago

I have iterm2 open right now with Claude in a long session and it's only using 500MB of memory.

link

pizlonator 12 days ago

Only 500MB!

you are confirming their point even as you contradict the specifics

link

z3c0 12 days ago

And highlighting a disconnect in the developer community. Some of us are okay with unnecessary overhead for quick results. I always felt gross dealing with Electron apps, but they're popular for a reason.

link

deathanatos 12 days ago

But each day now that overhead becomes more costly as AI drives up the very cost per byte of RAM.

link

verdverm 12 days ago

they make one of those electron apps too

link

toephu2 11 days ago

Followup: I closed out Claude completely, iterm2 completely. Reopened iterm2, and it appears iterm2 is using about 500MB of memory. So this has nothing to do with Claude Code CLI.

link

ChrisLTD 12 days ago

Yeah. Bonkers considering the brain of the application isn’t even on your device.

link

andriy_koval 12 days ago

Maybe that gigabyte is occupied by useful information: traces/memory?

link

flexagoon 12 days ago

Traces and memory are text. A gigabyte of text is an insane amount. That is an equivalent of tens of millions of lines of code, or hundreds of millions of AI tokens.

link

overgard 12 days ago

A gigabyte is a lot of memory. Even the largest context windows are a small fraction of that with any sane engineering discipline.

link

andriy_koval 12 days ago

For each LLM interaction they likely have bunch of thoughts traces, tool calls, etc, which don't go to context, but still can be retrieved.

But I obviously don't know for sure.

link

javcasas 12 days ago

Nope. Used to render on the terminal like a game engine.

https://x.com/trq212/status/2014051501786931427

link

oersted 12 days ago

This kind of immediate-mode rendering is quite standard for TUIs. Although immediate-mode rendering tends to be significantly simpler and use less memory than retained-mode rendering, at the cost of some redundant computation. So I am not sure if this is the reason for the bloat.

It’s possible that it doesn’t play well with JS garbage collection, since it recreates the whole UI structure for every frame (which tends to not to be an issue in the languages immediate-mode is usually employed).

But yes it’s a bit more akin to game renderings than web rendering. Which can be totally fine if done well.

link

overgard 12 days ago

I haven't tried to make a TUI admittedly, but double buffering is the oldest technique on the planet. A TUI doesn't even need to pay the cost of a lot of pixels since its effective resolution is much lower

link

javcasas 11 days ago

Long long time ago, I used to do some graphics stuff in 320x240, which uses a whopping 64KB per buffer, and still has more resolution than a terminal.

In 1GB I could probably fit all the buffers to double-buffer all the TUIs in a whole country. Well, maybe not. But it's likely not that far off.

link

est 12 days ago

> render on the terminal like a game engine

All those CPU to render this effect

https://x.com/cyrilXBT/status/2060617507615207904

link

javcasas 12 days ago

ULTRACODE! NOW WITH MORE BLINKENLIGHTS!

(to be read with the Unreal Tournament announcer voices, see https://www.youtube.com/watch?v=MwxjYFqP35A )

link

ux266478 12 days ago

How on earth are you spending more than 50us on a UI like this from start to finish? What the actual hell? 11ms to construct a scenegraph of this complexity? I don't even know what to say to that.

link

nicce 12 days ago

In comparison, I have around 3ms total latency when streaming 4k 144hz from headless machine in my basement to upstars :-D

At least that is what Moonlight client claims.

link

krapp 12 days ago

Frankly that's an insult to gamedev. Literally every game engine I can think of could do better. Probably even Unreal Engine could do better.

link

ux266478 12 days ago

If I saw our UI show up in the profiler eating 5ms of CPU time every frame, I'd send whoever was responsible to QA hell until they find some way to redeem themselves. Not even fancy animated 3D UIs, like what you get in Death Stranding, eat up these kinds of resources. Not even remotely close.

link

lstodd 12 days ago

I sorta remember Quake console running on an 486dx2 ..

link

airstrike 12 days ago

LOL right? This is all that needs to be said about the engineering behind Claude Code

link

fg137 12 days ago

Do game engines constantly have buffer issues?

link

overgard 12 days ago

Depends on if they're written with Claude

link

davidatbu 12 days ago

So would you take these claims seriously if they came from OpenAI (since Codex is a pretty lean CLI app)?

If so, I think it would be in the spirit of HN to discuss the subject matter of the blogpost (increasingly autonomous coding towards the end goal of RSI) as if the blog post was indeed from OpenAI. OpenAI is, by all accounts, going through a very similar process anyways.

link

Jtarii 12 days ago

Well, they could very easily if they wanted. There is just no economic value in it.

link

Lplololopo 12 days ago

Really? Let me explain how bigger companies work:

They have different teams for different departments with different type of people.

So the team or teams responsible for writing the terminal application are different people than the researchers doing the learning.

This can lead to dimentral quality aspects.

link

cpursley 12 days ago

A came here just to write: Pretty please let it churn for a few nights and redo Claude Code in Rust. Because the harness is very very good as are their models, but that node thing is a hog for no good reason at all.

link

ale 12 days ago

Incoming rust rewrite branch ready to merge: +1,009,257 -4,024

link

canadiantim 12 days ago

People already rebuilt Claude Code in Rust after the Claude Code leak, it's on github as claw code (and other variants)

link

bpodgursky 12 days ago

They obviously don't care, aren't making any attempt whatsoever to do this, and 99% of users don't care either.

If you want to pollute your own priors with weird artificial litmus tests, it's a free country, but the artificial world-model you build in your head does not affect the real world around you.

link