Hacker News new | ask | show | jobs
by schappim 1098 days ago
Sub 2 second generations on cell phones, nice! Better FID and CLIP scores than Stable Diffusion v1.5 with 50 steps, great!

So are they gonna release the code, or do they only open-source ad-SDKs[1]?

[1] https://github.com/orgs/Snapchat/repositories

1 comments

I'll believe thier hypothesis when I can run the open source code on my iPhone in 2 seconds, doubt it's that fast.
That demo doesn’t really do anything for me to know if it’s on device or not.

Without source code it could just be calling the OpenAI api.

The phone is in airplane mode
The issue is that it’s a controlled video from the author. I can still get Wi-Fi and Bluetooth in airplane mode so the airplane mode sign isn’t “proof” enough for me to accept the paper conclusions.

I hope it’s real, but posting a YouTube to assuage the “show me the implementation” isn’t going to help with my cynicism.

They clearly are not connected to WiFi. You’re concocting a pretty far fetched theory at this point.
The paper describes the implementation including a detailed breakdown of the optimisation algorithm itself. It's also plausible an iPhone 14 Pro could do it given its memory b/w, ops/s and that it can fit the SD model in RAM.
How do we know the video isn't ai generated? (fnord)
In that video it takes 5 seconds.