| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by PeterisP 1170 days ago
	There is a soft limit due to the computation required; the currently used model architectures are quadratic with respect to context size, so if you want ten times larger context size, that's going to need a hundred times more effort.