Y
Hacker News
new
|
ask
|
show
|
jobs
by
AndrewKemendo
28 days ago
Training RL policies on edge cases by using humans to collect and instrument previously closed data systems.