Hacker News new | ask | show | jobs
by demirbey05 535 days ago
Yes its reinforcement learning, but need to create policy and each policy is specialized for specific tasks.