Hacker News new | ask | show | jobs
by ReptileMan 3 hours ago
You could probably do with couple of instances. People rarely use ai 24/7, so right now you can oversubscribe and still have acceptable latency and high utilization rate.