| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by happyPersonR 452 days ago
	Pretty sure llama.cpp can already do that

1 comments

TYMorningCoffee 452 days ago

I forgot to clarify dealing with the network bottleneck

link

moralestapia 451 days ago

Just my two cents from experience, any sufficiently advanced LLM training or inference pipeline eventually figures out that the real bottleneck is the network!

link