|
|
|
|
|
by anthonybsd
56 days ago
|
|
I'm talking about per-request model remember? With extensive prompt you realistically can have one request every 10 minutes because the agent will be busy for at least 10 minutes executing it. They aren't rate limiting that. |
|
The harness receives a response, has to parse out the tool call, execute it and then start a new request with the tool call result.