Y
Hacker News
new
|
ask
|
show
|
jobs
by
eru
181 days ago
And our LLMs still have latencies well into the human perceptible range. If there's any necessary, architectural difference in latency between TPU and GPU, I'm fairly sure it would be far below that.