Hacker News new | ask | show | jobs
by nullpoint420 5 hours ago
Okay, I'll bite. What if your workload genuinely doesn't fit on one machine? Like load balancing or clustering 20+ nodes for LLM inference?