| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by arthurcolle 25 days ago
	I found that keeping current context utilization at 18% of total context length was best for minimizing spend, across all models with 400k context length or more