|
|
|
|
|
by aurbano
477 days ago
|
|
Is AI the danger, or is our inability to simplify a problem down to an objective function the problem? If anything, AI could help by "understanding" the real objective, so we don't have to code these simplified goals that ML models end up gaming no? |
|
I feel that a good first step would be to introduce some kind of random jitter into the simulation. Like, in case of the wheels, introduce road bumps, and perhaps start each run by simulating dropping the wheel from a short distance. This should quickly weed out "too clever" solutions - as long as the jitter is random enough, so RL won't pick up on it and start to exploit its non-randomness.
Speaking of road bumps: there is no such thing in reality as a perfectly flat road; if the wheel simulator is just rolling wheels on mathematically perfect roads, that's a big deviation from reality - precisely the kind that allows for "hacky" solutions that are not possible in the real world.