| > I think it's a bit short-sighted to assume that the future development of AI models will only come from large corporations or research institutions. If this was a "forever" solution, I would agree. I believe the goal is more along the lines of trying to make some progress with the foundation of what "safety" even means in this concept. > Just because a government heavily regulates AI and possibly bans open weights doesn't mean that people on the fringes of legality will abide by it. Nor nation states. However, we do have international treaties — flawed though they are — on things from nuclear proliferation to CFCs. As a smaller-scale example: we can't totally prevent gun crime, yet the UK manages to be so much safer in this regard than the USA that even the police in the UK say that they do not want to be armed in the course of their duties. I don't know how realistic the concerns are for any given model; unfortunately, part of the problem is that nobody else really knows either — if we knew how to tell in advance which models were and were not safe, nobody would need to ask for a pause in development. All we have is the age-old split of neophilia and neophobia, of trying things and of being scared of change. We get this right, it's supper happy fun post-scarcity fully automated luxury space communism for all. We get it wrong, and there's more potential dystopias than have yet been written. |
Right now, what “safety” seems to mean in practice is, big (mostly American) corporations imposing their ethical judgements on everyone else, whether or not everyone else happens to agree with them. And I’m sceptical it is going to mean anything more than that any time soon.
If one is seriously concerned about the risk that “superintelligent AI decides to exterminate humanity”, I think this kind of “safety” actually increases that risk. Humans radically disagree on fundamental values, and that value diversity, those irreconcilable differences - from the values of the average Silicon Valley “AI safety researcher” to the values of Ali Khamenei - creates a tension which prevents any one country/institution/movement/government/party/religion/etc from “taking over the planet”. If advanced AIs have the same value diversity, they’ll have the same irreconcilable differences, which will undermine any attempt by them to coordinate against humanity. If we enforce an ethical monoculture (based on a particular dominant value system) on AIs, which is what a lot of this “safety” stuff actually about, that removes that safety protection.
It would be rather ironic if, in the name of protecting humanity from extinction, “AI safety researchers” are actually helping to bring it about