| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by marvin 3651 days ago

I think you've hit the nail on the head here, this feels to me like one of a few responses in this thread that gets to the core of the issue if we're thinking about long-term AI risk.

In fact, I'm surprised that there doesn't seem to be any reference in the article to previous work on these philosophical implications, e.g. the stuff that has been written by Nick Bostrom or MIRI. Perhaps there are some in the paper?

I think that for the forseeable future, we will inevitably end up with two of the problems that various philosophers have outlined over the last few years:

(1) How do we ensure that an AI agent does exactly what we want it to do and

(2) What do we ultimately want if we can desire anything?

I think that any developer trying to approach this will be doomed to hack around these two issues. We can probably come a long way in AI capabilities without having the optimal solution to this, but the core problem will remain for a long time and haunt those who are cautious.

1 comments

argonaut 3651 days ago

1) Is exactly what this paper is addressing. The fact that their is philosophical ambiguity is precisely why these are problems, and not solved already.

link