Hacker News new | ask | show | jobs
by spywaregorilla 1814 days ago
Without knowledge of their reward function its difficult to tell if they're converged on this strategy or if its just broken.