Hacker News new | ask | show | jobs
by thewarrior 1154 days ago
At a high level yes.

More precisely - It gets the question After irs passed through a matrix that transforms the text description of the image so the LLM can “understand” it.

It maps from the space of one ML model to the other.