| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by DeborahEmeni_ 478 days ago
	Really cool setup! Curious how much of the performance here could vary depending on whether the model runs in a hosted environment vs local. Would love to see benchmarks that also track how cloud-based eval platforms (with potential rate limits, context resets, or system messages) might affect things like memory or secret-keeping over multiple rounds.