Hacker News new | ask | show | jobs
by chenning 2431 days ago
Don't you need data aggregation to build the model though? I think that's part of the problem, but I could be wrong.
1 comments

No. You could do this with any model that has feedback built in. I don't think you'd even need something as complicated as Q-learning but any type of reinforcement "learning" method would work. You're basically teaching a PID that has a clock.