Hacker News new | ask | show | jobs
by boogies 1931 days ago
https://github.com/eihli/image-table-ocr seems to automatically find tables within larger images, IDK if it works without borders though.
1 comments

The logic for detecting a table is to get rid of everything but vertical lines over a certain length, save that in one image, then get rid of everything but horizontal lines of a certain length, save that image. Then overlay the two and take the bounding rectangle. So you don't need the table to have a border as long as you have vertical and horizontal lines and they extend far enough to encompass all the data you need.