Hacker News new | ask | show | jobs
by ttctciyf 2949 days ago
> It is a well constrained problem

But attacking not-well-constrained problems is what's needed to show real progress in AI these days, right?

1 comments

I'd say getting better sample efficiency is a bigger deal. It isn't like POMDP's are a huge step away theoretically from MDP's. But if you attach one of these things to a robot, taking 10^7 samples to learn a policy is a deal breaker. So fine, please keep using games to research with.