Y
Hacker News
new
|
ask
|
show
|
jobs
by
stevenpetryk
249 days ago
This is referred to as “online reinforcement learning” and is already something done by, for example Cursor for their tab prediction model.
https://cursor.com/blog/tab-rl
1 comments
tinodb
246 days ago
Not sure that’s the same. They just very frequently retrain and “deploy a new model”.
link