Hacker News new | ask | show | jobs
by ninetyninenine 733 days ago
It’s all about the training. Llms can have better understanding of space then humans to the point where they can draw things better than us.

Don’t restrict LLMs to text. If you train one with images and text you’ll get one that understands position.