How much are people willing to pay? A quick search shows Google's STT API at $1.44/hour. As an example, the Joe Rogan Experience is ~1500 multi-hour episodes, meaning it would cost >$5000 for just that one show.
Presumably the OP is using an offline speech processing tool, but compute costs would still be expensive.
Presumably the OP is using an offline speech processing tool, but compute costs would still be expensive.