Hacker News new | ask | show | jobs
by hovering_nox 746 days ago
I think CogVLM2 is even better than Intern at OCR (my usecase is extracting information from an invoice)
1 comments

After some superficial testing I with bad quality scans you can find on kaggle I can not confirm that. CogVLM2 refuses to handle scans that InternVL-V1.5 still can comprehend.