Hacker News new | ask | show | jobs
by scotty79 197 days ago
Images are not that big. Each text token is a multidimensional vector.

There were recent observations that rendering the text as an image and ingesting the image might actually be more efficient than using text embedding.