|
|
|
|
|
by radarsat1
2906 days ago
|
|
I think that analogy is a bit bogus, but if you want to make it, it's more like assuming access to a function that renders the 3D model from a variety of perspectives on command, not having access to the model itself. (Because the RL algorithm doesn't have access to the rules by which the simulation is carried out, it only has access to the commands and the result.) And frankly, that would be a perfectly fair and interesting classification problem, so I don't see your point. Otherwise, how exactly do you propose learning to drive a simulation without access to the simulation? I really don't know what you're saying here. |
|
Thanks for your analogy though. I agree that it's better than mine. I was only trying to give a rough idea, but I'll use your analogy if I have to now. :)