Hacker News new | ask | show | jobs
by jvictor118 421 days ago
i've found the vision capabilities are very bad with spatial awareness/reasoning. They seem to know that certain things are in the image, but not where they are relative to each other, their relative sizes, etc.