|
|
|
|
|
by silveraxe93
488 days ago
|
|
Posted 4 days ago: > Three state of the art VLMs - Claude-3, Gemini-1.5, and GPT-4o Literally none of those are state of the art. Academia is completely unprepared to deal with the speed Ai develops.
This is extremely common in research papers. That's literally in the abstract. If I can see a completely wrong sentence 5 seconds into reading the paper, why should I read the rest? |
|
Honestly I thought Claude-3 and GPT-4o were some of the newest major models with vision support, and that models like o1 and deepseek were more reasoning-oriented than OCR-oriented.