|
|
|
|
|
by Animats
1846 days ago
|
|
Their paper "Concrete problems in AI safety"[1] is interesting. Could be more concrete.
They're run into the "common sense" problem, which I sometimes define, for robots, as "getting through the next 30 seconds without screwing up". They're trying to address it by playing with the weighting in goal functions for machine learning. They write "Yet intuitively it seems like it should often be possible to predict which actions are dangerous and explore in a way that avoids them, even when we don’t have that much information about the environment." For humans, yes. None of the tweaks on machine learning they suggest do that, though. If your constraints are in the objective function, the objective function needs to contain the model of "don't do that". Which means you've just moved the common sense problem to the objective function. Important problem to work on, even though nobody has made much progress on it in decades. [1] https://arxiv.org/pdf/1606.06565.pdf |
|
The success of deep learning might be something of a curse - it's go enough success that creating a safe system seems to automatically be modifying a neural net to be safe despite it not having the "engineered from the start" quality.