Y
Hacker News
new
|
ask
|
show
|
jobs
by
adamnemecek
53 days ago
I think that you send the entire conversation with every request.
1 comments
darkteflon
53 days ago
As long as you stay under the 1-hour caching TTL for your open threads, I guess your marginal cost is linear.
This is me on a weekday flicking between Ghostty tabs to enter “stand by” every ~45 mins.
link
ricardobeat
53 days ago
Anthropic changed the cache TTL to five minutes, back in March.
link
darkteflon
53 days ago
Thanks, didn’t realise the API and Claude Code had different TTL.
link
This is me on a weekday flicking between Ghostty tabs to enter “stand by” every ~45 mins.