Hacker News new | ask | show | jobs
by deneas 497 days ago
I'd imagine using optimized/faster reward functions could already make a difference.