On the surface this seems incredibly stupid. But after thinking on it for a minute - maybe use cases with very low tokens in, very low tokens out, makes sense. Still feels awful, but maybe. Probably not. But maybe.
I'm wondering if there's a prototyping use case in there somewhere. Like... throw in a bunch of LLM calls that return vaguely sane data, in order to get the thing running, then replace them with something reliable before you get to production. Would that speed up building a demo enough to be worth doing?