Hacker News new | ask | show | jobs
by ACCount36 347 days ago
Even the very best multimodal LLMs still suffer from a harsh perception bottleneck. They're impressive, but nowhere near as good as human visual cortex.