Hacker News new | ask | show | jobs
by loufe 972 days ago
That's not true, there are some very pruned (and relatively dumb) LLMs which go under a gig.
2 comments

Small LLMs or large small language models???
What’s the point of a state of the art AI chip that can’t run large models? It seems problematic to say the least!
It is a new design, not a direct competitor for nVidia’s flagship. And there are lots more applications for AI than LLMs.
Realtime, low power, isn't really possible today. Think anything motion or reaction related.
Some of us still work on computer vision. 224MiB is fairly massive for convolutional neural network.
It can run small models very fast.