Hacker News new | ask | show | jobs
by echelon 1389 days ago
Google hasn't released squat.

Google's product is vaporware and we shouldn't afford them any airtime until they release something usable. They're just trying to butt in and get press off of the backs of the teams actually working in the open, and that's super lame.

Release your model, Google, or stop bragging and talking over the others here. You're greedily sucking oxygen out of the conversation, and as a trillion dollar monopoly you don't deserve anything for free off of the backs of others. Not when you're not contributing. Stop being the rich kid talking over everyone else about how awesome your toys are.

Anyhow, the real story is Stable Diffusion. They're actively demonstrating the correct way to run this as opposed to the entirely closed OpenAI DALL-E or the (again vaporware) Google non-product.

Even MidJourney uses Stable Diffusion under the hood, using sophisticated prompt engineering to make their product distinct and powerful.

5 comments

I feel there's a strong argument to be made that these organizations should be required to release these models publicly. These are built on the works of the public at large, and the public should get the full benefit of them.

Whatever effort Google has put into building the model is infinitesimally small compared to the work of the creators they're harvesting.

I don't expect this to happen easily, if at all, but I'm strongly in favor of it, and would even support legislation to that effect.

They are afraid of being sued because they are using all the images they have scraped on all the website ever created. They are probably even using images not publicly available.
Well… midjourney used stable diffusion (with an additional guidance model I believe, not just prompt engineering) for their beta model which they already closed down again… it’s back to their old far inferior model.
Why did they close it down?
The rumours are that it was too good at generating nudity for their comfort, and in particular that some users may have combined that with younger subjects.
I kind of get the sentiment about openness but I think it's way more nuanced than you are making out.

There are very good reasons for withholding SOTA models, primarily from the info hazard angle and avoiding escalating the capabilities race which is basically the biggest risk we have right now.

Google / Deepmind have actually made some good decisions to try and slow down the race (such as waiting to publish).

They're not slowing down anything. The cat's out of the bag.

What good does a few months lag do when nobody is bracing for impact?

I'm not saying they are doing a good enough job, but that doesn't mean their approach isn't entirely without merit.

Even ignoring the infohazard angle if they published everything immediately that would escalate the race. By sitting on their capabilities and waiting for others to publish (e.g. PaLM, Imagen vs GPT-3, DALL-E) they are at least only playing catch up.

Capabilities race, seriously? This is not nuclear warfare my guy. It's mathematics.
Nuclear warfare is much less concerning than misaligned AI.

Take a look into scaling laws and alignment concerns, this is a very real challenge and existential risk not some crackpot theory.

In the same sense that deep learning is just linear regression with a steroid problem.
Information warfare is pretty dangerous too!
Can you talk more about the prompt augmentation the midjourney is doing behind the scenes? It's certainly true that you can put in a two-word phrase like "Time travelers" and get an amazing result back, which reveals just how much your prompt is getting dropped into a prompt soup that also gives it that midjourney look by default.