Y
Hacker News
new
|
ask
|
show
|
jobs
by
jeremyjh
58 days ago
This has nothing to do with the cost of storage. Surprisingly, you are not better informed than Anthropic on the subject of serving AI inference models.
A sibling comment explains:
https://news.ycombinator.com/item?id=47886200
1 comments
uoaei
55 days ago
They don't cache model state to disk. I am proposing they do.
link
jeremyjh
55 days ago
I’m proposing that you should educate yourself on the subject of LLM KV context caching.
link