Hacker News new | ask | show | jobs
by wruza 817 days ago
So the networks you mentioned aren’t LLMs? Why is that a correct comparison then. Like blaming a human that they can’t jump like a cat or multiply like an arbitrary-precision library.
1 comments

> So the networks you mentioned aren’t LLMs? Why is that a correct comparison then

Because an LLM is a neural network and neural networks contains neural networks. There is nothing stopping it from having an embedded neural network that learned how to do computations well, except an inability to identify such structures and patterns well enough to train for it.

Tokenizing ‘1735’ as a value of 1735 because you’ve seen a lot of math is probably the most difficult part.