|
|
|
|
|
by crishoj
264 days ago
|
|
Any idea what "output token efficiency" refers to?
Gemini Flash is billed by number of input/output tokens, which I assume is fixed for the same output, so I'm struggling to understand how it could result in lower cost. Unless of course they have changed tokenization in the new version? |
|
Which is a good thing in my book as the models now are way too verbose (and I suspect one of the reasons is the billing by tokens).