|
|
|
|
|
by cl1nk
951 days ago
|
|
You are not, and this demo is total fail. Not only the cube didn't became a sphere, the AI took ages to reply, the instructions were wrong and the result was a total mess. I assume most people in these comments don't understand 3d modeling, or they are seriously optimistic about THE IDEA of vision assistant AIs, but this demo is not exciting at all. In fact is detrimental to showcase real utility |
|
This is just an early taste of a potentially powerful use case.
I understand the vision API doesn’t have memory, so each screenshot it takes is like an entire new context. If the script/application is able to send WHAT application it’s in, and has some RAG database in the backend to pull knowledge from, this would be incredibly useful.
Of course it’s slow now. If you’re legitimately stuck, a couple seconds for a personalized answer is a perfect trade off. It will get better.