Hacker News new | ask | show | jobs
by AndrewKemendo 28 days ago
Training RL policies on edge cases by using humans to collect and instrument previously closed data systems.