| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by preston25 93 days ago
	How do you think Primer will evolve as agents get better at long-horizon tasks? The milestone and verification loop make sense now as agents fail unpredictably on complex tasks. Do tasks just get bigger and milestones scale with them?