| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by stevenpetryk 249 days ago
	This is referred to as “online reinforcement learning” and is already something done by, for example Cursor for their tab prediction model. https://cursor.com/blog/tab-rl

1 comments

Not sure that’s the same. They just very frequently retrain and “deploy a new model”.