Hacker News new | ask | show | jobs
Rate limiter for LLMs outperforms exponential backoff (github.com)
2 points by sftombu 634 days ago
1 comments

rateLLMiter is a Python rate limiter that smoothes out requests to LLM APIs to get faster, more consistent performance. It uses a ticket bucket algorithm rather than the usual exponential backoff.