Hacker News new | ask | show | jobs
by angry_octet 42 days ago
It seems that tool calling shouldn't be 500ms of latency?
1 comments

If you have tool calling complex enough that it necessitates a higher reasoning level, and you would otherwise have reasoning set to "none", this can easily come out to 500ms.