Y
Hacker News
new
|
ask
|
show
|
jobs
by
angry_octet
42 days ago
It seems that tool calling shouldn't be 500ms of latency?
1 comments
hobofan
42 days ago
If you have tool calling complex enough that it necessitates a higher reasoning level, and you would otherwise have reasoning set to "none", this can easily come out to 500ms.
link