Hacker News new | ask | show | jobs
by cl1nk 951 days ago
You are not, and this demo is total fail. Not only the cube didn't became a sphere, the AI took ages to reply, the instructions were wrong and the result was a total mess.

I assume most people in these comments don't understand 3d modeling, or they are seriously optimistic about THE IDEA of vision assistant AIs, but this demo is not exciting at all. In fact is detrimental to showcase real utility

2 comments

Classic HN response.

This is just an early taste of a potentially powerful use case.

I understand the vision API doesn’t have memory, so each screenshot it takes is like an entire new context. If the script/application is able to send WHAT application it’s in, and has some RAG database in the backend to pull knowledge from, this would be incredibly useful.

Of course it’s slow now. If you’re legitimately stuck, a couple seconds for a personalized answer is a perfect trade off. It will get better.

I think every UI application should start logging actions the user takes so that AI could learn the mappings from actions to visual output. It would be amazing form of data.
I could say your comment is a classic 2023 HN comment..? There is no reason to be overly optimistic anbout other people’s products. Plus, nobody said “oh wow this will never work”, it’s just currently quite bad.
I couldn’t hear it perfectly, but I’m pretty sure the instructions it provided were to transform the vertices of the cube to make the sphere. It’s like using MS Frontpage. It may look right, but it’s a convoluted mess underneath.
Have mercy on him. Remember this is the worst version it would ever be