Hacker News new | ask | show | jobs
by theSage 2336 days ago
For what it's worth, at my previous place we built a YOLO based model for detecting paragraphs/tables/headlines/page layouts mixed with traditional rule based OCR/layout detection.

https://www.youtube.com/watch?v=VVdHFqhQRUk

https://voody.clapresearch.com/