Hacker News new | ask | show | jobs
by Mkengin 212 days ago
Interesting. So similar to the vision encoder + projector in VLMs?