Hacker News new | ask | show | jobs
No Need for Speed: Why Batch LLM Inference Is Often the Smarter Choice (sutro.sh)
4 points by cmogni1 367 days ago