| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by storus 94 days ago
	They significantly lowered latency compared to EPYC/Xeon, which is critical for streaming agents (e.g. text/audio/video agents).

1 comments

What latency? How much is it compared to LLM inference speed?

See the Redpanda comment/link here.