| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by bavell 62 days ago
	> I would also expect to see it taking exponentially longer to process a prompt. I don't believe LLMs work like that. Try this out using a local LLM. You'll see that as the conversation grows, your prompts take longer to execute. It's not exponential but it's significant. This is in fact how all autoregressive LLMs work.