| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by caesil 712 days ago
	Why not exactly? The model weights cannot encode a sort of math engine? The hidden state cannot encode carryover values? Why do we assume these things can't happen at some level?

1 comments

jasfi 712 days ago

I agree. The reasoning is there, and becoming more capable every year (across the various models). It's easy to look for limitations, but what was once glaring problems are now much more subtle.

link