Hacker News new | ask | show | jobs
by christina97 127 days ago
Author is clearly confused about the Anthropic case. The request rate at these generation endpoints is so high that the current batching delay is effectively negligible.