No Need for Speed: Why Batch LLM Inference Is Often the Smarter Choice

Y	Hacker News new \| ask \| show \| jobs

	No Need for Speed: Why Batch LLM Inference Is Often the Smarter Choice (sutro.sh)
	4 points by cmogni1 367 days ago