Hacker News new | ask | show | jobs
by luke_cq 1169 days ago
I desperately want to be able to get a concrete amount of tokens for my prompt before making a call - things like this make it very hard to request the right amount of max_tokens from longer prompt/generation pairs.
1 comments