Hacker News new | ask | show | jobs
Genie: Text to 3D in Seconds (lumalabs.ai)
62 points by tancik 967 days ago
10 comments

Oh no, not another "join our discord to use it".
Its a weird choice of UI.

Why do companies do this? Is it a development shortcut or something?

Image generation needs moderation and file storage, and Discord gives you that for free.
More importantly, it is a built in way to show lots of generated content to a large audience, which can then thumbs-up content they like, which gets fed back into the model that X result was better than Y result.
No, that's not it. Nobody's doing that on Luma and I think Midjourney put ratings on their website instead.
Using discord would save you a lot on hosting costs, especially when dealing with huge quantities of assets like images and videos.
Probably. If your small team is competent at ML model development, why spend time building a web app?

One time I participated in a small local event that required submitting photos via tagged Instagram posts. I was annoyed at that, until I realized it completely eliminated the need for any technical expertise for the organizers.

It's like Reddit but with functionality. Same benefits:

- single sign-on

- abuse protection

- has sign-on flow for later if you want to migrate

- someone else hosts assets

- existing account flow means low friction to join for existing account holders (common in the space)

- allows users to see other users at same time as use product

- bot easy to spin up and async functionality is inherent

> Its a weird choice of UI.

Yeah for sure. I guess one advantage is that since it's already in a chat, people can react and talk about it as it's happening.

Much more direct connection with users and the community can be built in the same place the tool exists.

It's annoying but I can see why its happening

Both a dev shortcut as well as a stickiness play. Adding a Discord server is like installing a smartphone app.
These are decidedly not like the one on the homepage.
There's nothing on the homepage for me except a big logo and a join discord button...
The graphic on the homepage is animated, it changes shape as you drag on it with your mouse or finger. Worked fine on my phone.
The main problem you're not mentioning is lack of ability to disable notifications. There's just no way to opt out of the @everyone notifications.
Unless I'm misunderstanding, you can do that, if you right click a server under Notification Settings there is an option "Suppress @everywhere and @here" you can toggle.

It's annoying to have to do this for every server though, I agree.

you're the hero I needed. I hope this keeps the red "1+" numbers gone.
I believe it does
Discord is aggressively horrible.
To be "that guy" I like this trend. It combines product usage w/ discussion. That said, I believe a web UI would be strictly better.
Discord as an open protocol we could all build our service around would be sick.

There would still have to be hosting, moderation controls, high availability, low latency, etc. etc. engineering that "someone" would have to handle and own (ideally decentralized / p2p), but I believe it could be solved if ever incentives could align and a will could arise.

Yes, lots of people don't like using Discord, but I can't even get it to work. Discord just tells me 'Unable to accept invite'
I don't get an error, but also nothing happens when I click accept.

edit: worked on my phone

I was also seeing this issue with Discord. Seems like taking the invite link and pasting in "Join Server" works reliably. https://share.cleanshot.com/KscggVQf
Unreasonably enjoyed the front page graphic. Played with it with right click and left click. This sort of thing reminds me of old demoscene intros to warez. Haha!
Came here to say the exact same thing. The Homepage graphic is absolutely mesmerizing.
Was there anything shared on how they achieved these results? One big hurdle for prompt to 3D is usually the super long generation time while they (seem) to do it in seconds.
Any way to try out without discord?
For those not wanting to touch discord here are some better: https://lumalabs.ai/genie?one=11eeebfc-de82-4bec-9c65-1eae57... https://lumalabs.ai/genie?one=6bc23542-c2d2-48c9-b07a-b00b0c... https://lumalabs.ai/genie?one=06d2e1bc-6290-439f-82c3-b6dab4...

and not so good examples:

https://lumalabs.ai/genie?one=15889f37-f001-43ca-bfbf-cd75d2... https://lumalabs.ai/genie?one=b7352636-2203-4ec2-9b5b-d1dd80...

In the initial preview models look less lumpy and overall much better. So I guess the transformation from 3d pointcloud/gaussian splatter or whatever it uses to 3d meshes isn't great.

Thanks. The red samurai mask is pretty good
I hate this business model.
Non-discord interfaces in the pipeline!
Discord? Pass.
Yeah a bizarre choice and instant turn off for me too.
The models dont look bad! But based on my dives into gaming, the modeling work for animating anything that moves (rigging, weight painting, retopology, creating animations, lighting) is still unsolved. This is great for static assets like boxes or things that add flavor to an environment.
Just had a play with it, and yes the topology and UV maps it spits out are rubbish. That said, I can easily use the models as concepts/guides and create new meshes over the top piece by piece. I lack artistic imagination when getting started but I do know how to sculpt and model reasonably well. This is perfect for me.
There are AI rigging tools in development (some prototypes have been released).

https://actorcore.reallusion.com/auto-rig/accurig

But the sculpts also have to be good (eg. topology) and have to fit the rigs.

This is going to be a longer slog to solve than image generation (though I'm sure we'll get there in a few years).

Landing page is awesome, wonder how it was created
Thank you! I worked on this

We start with a point cloud generated from the model which has color and normal and render each particle as a little hexagon with metallic and iridescent shading. Each particle is given a position and velocity in a texture. Each frame we do a physics step that writes to these textures with fragment shaders. We sum the forces into the velocity texture:

- mouse interaction force

- flow force from time varying curl noise

- spring force to return to original position

- air resistance

last we do a euler step to update position texture. There’s masses of free parameters to control forces and materials, the animation is created by changing parameters per-particle over time

Tech: three.js, GLSL & TypeScript

Tip: Left click to push apart, right click to pull together!

This is really awesome -- well done!
Is that sarcasm?
Probably not, there is an impressive interactive 3D point cloud rendering, which only shows up after a few seconds. Perhaps you missed that?
How many seconds? I see a discord button, and then some weird blob thing wobbling around. After 30 seconds I got bored of watching it.
30 seconds is a lot of time to stare at one thing. It sounds to me like it entranced you just as much as it did the others.

How long would you expect a single graphic to hold your attention before you can deem it impressive? Five minutes?

In that case you didn't miss anything. Weird wobbling blob thing is the "impressive interactive 3D point cloud".
You should be able to see the blob form models – perhaps there's some error when it runs on your system. It relies on floating-point textures which aren't supported everywhere

If you view this link https://lumalabs.ai/genie?s=1&d=-4, do you see the models more clearly?

If you _don't_ I'd be curious if any errors are reported in console

It's not, I was referring to the interactive 3D author explained.
Judging from other comments, it seems I'm not the only one who didn't get to experience that - perhaps I just didn't wait long enough for that to show up?

In good hackernews style, I got downvoted into oblivion for just a simple question, but I guess that's to be expected these days.

Interesting that nothing appears for you, we should aim to show a placeholder instead in this case

Do you get any errors in the js console? Are you on Destop or phone?

Firefox on Windows, but I used a pretty old machine - so perhaps it was just a matter of not having waited long enough?!
Understandable, there is a bit of delay to start and yeah it is the main thing that stands out.
A funny thing this has in common with most image generation models is it can't do "upside-down".
Discord should create a product out of these sites that want to use it as an interface.