Hacker News new | ask | show | jobs
by za_mike157 106 days ago
A lot of AI workloads require GPUs which are expensive so customers would waste money running idle machines 24/7 with low utilisation which kills gross margins. By loading containers quickly means, means we can scale up quickly as requests come in and you only need to pay for usage.

This is successful for CPU workloads (AWS Lambda) but AI models and images are 50x the size

1 comments

As I said, if only you were providing more value rather than being a commodity, you could avoid all this.