Y
Hacker News
new
|
ask
|
show
|
jobs
by
dloss
836 days ago
Which of the model requirements of Coral TPUs [1] are the most problematic for LLMs?
[1]
https://coral.ai/docs/edgetpu/models-intro/#model-requiremen...
2 comments
semisight
836 days ago
Guessing as to what the GP meant--coral TPUs max out around 8M parameters, IIRC. That's a few orders of magnitude less than the smallest LLM model.
link
moffkalast
836 days ago
The part where they have like 3 bytes of memory so you switch from extremely high latency of RAM to laughably sluggish latency of USB serial. I think there's also no support below 8 bit quants, which you'd really need.
link