Hacker News new | ask | show | jobs
by sabareesh 279 days ago
This is great,but most work is involved in curating the dataset and the objective functions for RL.