| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by wolpoli 870 days ago
	Early on, I noticed that if I ask ChatGPT an unique question that might not have been asked before, it'll split out a response slowly, but repeating the same question would result in a much quicker response. Is it possible that you have a caching system too so that you are able to respond instantly with paragraphs of technical information to some types of requests that you have seen before?

1 comments

Yes, search for LLM caching and semantic searches. They must be using something like that.