You really need some kind of free-tier, but I understand you're putting yourself at risk with that. Ideally you could run the model in the browser with WebGPU for trial users so that they could bear the cost.
I'd be curious about how big the model is. The loading time could be quite long. I suppose with caching, it won't be a big deal after the first run, though.