Hacker News new | ask | show | jobs
by kevinwang 2469 days ago
Open AI did the right thing by not releasing their model; it's disappointing that researchers are so callous about the potential effects of their research in the name of progress.
3 comments

I've never really gotten why AI types are so concerned about text-generation models.

Like, sure, I can kind of see why you wouldn't want to make the Deepfakes program public; it currently takes a lot of time, effort, and expertise to swap faces realistically in a video, and maybe we don't want to give every average Joe the ability to do that.

But pretty much everyone in the world can already pretty trivially write text. (I'm doing it right now!) And the "typical" generation output from these programs usually isn't very good—OpenAI had to try like thirty times for each of the prompts in their PR materials—so it usually ends up being less work to just write the fake news yourself instead of using the software.

My personal conspiracy theory is that all this talk of "the model is too dangerous to release" really boils down to "if we let people test out the model, they'll find it doesn't work as well as our PR team wants them to think it does".

I dunno, this time the text looks really good. I got as far as 5 or 6 phrases deep before it said anything silly. I would have been fooled if I red it in real life.

My guess is that they will perfect the transformer and its training process, curate the dataset and make this method really easy to use. Maybe it can do translation, math, even auto-complete code. That is only by iterating more on the current formulation of the Transformer.

But it is also possible that it is surpassed by something even better. This new language model could replace the inductive bias specific to the Transformer - the ability to "attend" to any part of the input text, with something more efficient, because Transformers are quite hard and expensive to train right now. Maybe the Transformer inductive bias is too general (like a fully connected network) and needs too much data, with a slightly different idea it could be made much more efficient and probably more convincing.

I sort of agree with the principle -- for instance, releasing easy-to-use pretrained SOTA models for "deepfakes" type tasks can cause real harm. There are existing repositories out there which are already morally dubious in my view -- if researchers from a large organization made a big advance and released the code, that would seem very irresponsible.

But I just don't buy that there's significant danger in the public having access to a generative language model, at their current level of quality.

It doesn't seem like this team was callous -- they seem to have honestly thought about potential problems before deciding to release it.

the reason openai didn't release their model was just marketing and using the AI hypetrain. None of these text generating models is ever going to fool anyone any more than a markov chain generator, as they have no grasp on the actual meaning of the text they're generating.

The openAI model produced sentences along the lines of "before the first human ever walked on earth, humans did such and such". Hiring workers in a developing country to write you propaganda is cheaper than training that model.