| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by gignico 133 days ago
	Exactly. This flawed argument by which everything will be fixed by future models drives me crazy every time.

3 comments

bopbopbop7 133 days ago

Just a couple more trillion and 6 more months!

link

simianwords 133 days ago

That’s been the trend for a while. Can you make a prediction that says something concretely like “AI will not be able to do X by 2028” for a specific and well defined X?

link

xigoi 133 days ago

In 2030, an AI model that I can run on my computer, without having to trust an evil megacorporation, will not be able to write a compiler for my markup language [0] based on a corpus of examples, without seeing the original implementation, using no more than 1.5× as much code as I did.

https://git.sr.ht/~xigoi/hilda

link

direwolf20 133 days ago

Were any made about 2025?

link

Kiro 133 days ago

So far it has been accurate though. Models have gotten much better than even the most optimistic predictions.

link

LinXitoW 133 days ago

No? The most optimistic predictions involved AGI around the corner, 6 months until no more developers for years now.

link

rsynnott 133 days ago

... Eh? A few years back, the usual suspects were predicting AGI by, usually, either 2026 or 2027. Think that's gonna happen?

(Such predictions have been quietly forgotten or revised forward, in general.)

link

Kiro 133 days ago

No, I don't but it sounds very similar to the the naysayers that have silently moved the goalposts. That said, you're one of the few people in the wild that still claims LLMs are completely useless so I give you that.

link

rsynnott 133 days ago

You said:

> Models have gotten much better than even the most optimistic predictions.

We were promised Roko's Basilisk by now, damnit! Where's my magical robot god?!

But seriously, predictions a couple years back for 2026/27 (by quite big players, like Altman) were for AGI or as good as.

I do not, for the record, claim that they are totally useless. They are useful where correctness of results does not matter, for instance low-stakes natural language translation and spam generation. There's _some_ argument that they are somewhat useful in cases where their output can be reviewed by an expert (code generation etc), though honestly quantitive evidence there is mixed at best; for all the "10x developer" claims, there's not much in the way of what you'd call hard evidence.

link