Y
Hacker News
new
|
ask
|
show
|
jobs
Rate limiter for LLMs outperforms exponential backoff
(
github.com
)
2 points
by
sftombu
634 days ago
1 comments
sftombu
634 days ago
rateLLMiter is a Python rate limiter that smoothes out requests to LLM APIs to get faster, more consistent performance. It uses a ticket bucket algorithm rather than the usual exponential backoff.
link