Hacker News new | ask | show | jobs
by reinitctxoffset 4 hours ago
Using a multi-trillion parameter softmax attention transformer to parse nested delimiters is a perverse thing to do. It is hard to imagine a sillier way to boil the oceans than feeding JSON to an LLM, a task that a pushdown automata from the 1960s effortlessly did on a PDP-X.

The API business throws a massive model that by definition can't be inferred efficiently because nothing can across 4 different compute substates, at a problem that DSv4 nails at or near 100% while leaving most of the actual unique value of Claude on the table.

Claude should be in your house and car and your kid's classroom and shit.

Having it write tail -n5?

That's because Anthropic's A-Team is Meta's C-Team. Hell, I fired some of their stars myself.

1 comments

You know what’s worse (less efficient) at parsing and writing code than LLMs?

Humans.