Hacker News new | ask | show | jobs
by xnx 79 days ago
Doesn't Gemini do this directly? I uploaded an image and asked it to identify the doors and it gave me a JSON array of the boxes.