Hacker News new | ask | show | jobs
by botirk 253 days ago
We built our infra on Azure during a hackathon. It made sense at the time, so we stuck with it.

For a while, Container Apps worked fine. Then we launched our AI model router demo, and everything changed.

In just two days, we spent over $250 on GPU compute. Two uni students, a side project, and suddenly we were paying production-level bills.

Autoscaling was slow. Cold starts were bad. Costs were unpredictable.

Then I watched a talk from one of Modal’s founders about GPU infra. We gave Modal a try.

Now we’re running the same workloads for under $100, with fast autoscaling and no lag.

Azure was stable, but Modal gave us speed, control, and real cost efficiency.

Anyone else switch from Azure (or AWS/GCP) to Modal for AI workloads? What was your experience?