Hacker News new | ask | show | jobs
by visarga 1771 days ago
DALL-E + CLIP models show a deep understanding of the relation between images and text.