|
|
|
|
|
by nomel
286 days ago
|
|
> But having them spitting tokens 24/7 for you would have you paying off a whole enterprise-scale GPU in a few months, too. Again, what's the use case? What would make sense to run, at high rates, where output quality isn't much of a concern? I'm genuinely interested in this question, because answering it always seems to be avoided. |
|