Hacker News new | ask | show | jobs
by nattaylor 364 days ago
The base model is Qwen2.5-VL-3B and the announcement says a limitation is "Model can suffer from hallucination"
1 comments

Seems a bit scary that the "source" text from the pdfs could actually be hallucinated.
Given that input is image and not raw pdf, its not completely unexpected