Hacker News new | ask | show | jobs
by weird-eye-issue 59 days ago
Again, it is not based on number of tokens. If it was solely based on number of tokens then things like cache misses would not impact the usage so much. It's based on the actual cost which includes things like the caching costs.