Hacker News new | ask | show | jobs
by hsuduebc2 37 days ago
So it's just a bigger model? Like for example todays 1T models?
1 comments

Supposedly 10T scale. Literally the next big thing. A bit like what OpenAI tried with GPT-4.5 - but Anthropic actually made it work with MoE, reasoning, tool use, RLVR, etc.

It matters because the "g factor" of today's LLMs is at least in part a function of raw scale. Larger models are just smarter - assuming you can handle the training and inference at this increased scale.

So, realistically, how much further can this go? How many more orders of magnitude?
100T definitely. 1Q maybe. Beyond that, we need new architectures or new types of inference hardware - maybe both.

By some estimates, 100T is about what the human brain has when used to its fullest.