| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by raunakchowdhuri 477 days ago
	We ran some benchmarks comparing against Gemini Flash 2.0. You can find the full writeup here: https://reducto.ai/blog/lvm-ocr-accuracy-mistral-gemini A high level summary is that while this is an impressive model, it underperforms even current SOTA VLMs on document parsing and has a tendency to hallucinate with OCR, table structure, and drop content.

2 comments

shrisukhani 476 days ago

Anecdotally, we also found Gemini Flash to be better.

link

hackernewds 476 days ago

meanwhile, you're comparing it to the output of almost a trillion dollar company

link

stann 476 days ago

The tagline boasts that it is "introducing the world’s best document understanding API". So, holding them to their marketing seems fair

link

neuronic 476 days ago

Isn't anyone who releases anything putting "the world's best blablabla" on their page nowadays? I've become entirely blind to it.

link

dwedge 476 days ago

If they put it, and it's subpar, I write off the product.

link

HaZeust 476 days ago

... And? We're judging it for the merits of the technology it purports to be, not the pockets of the people that bankroll them. Probably not fair - sure, but when I pick my OCR, I want to pick SOTA. These comparisons and announcements help me find those.

link

raunakchowdhuri 476 days ago

comparisons to more outputs coming soon!

link