|
|
|
|
|
by sourc3
3963 days ago
|
|
I have been working on a side project that needs to read dynamic table layouts and extract financial information. I was excited to hear about Tabula a few weeks ago but I had 0 success in getting even one PDF extracted. I ended up using pdfquery package in python which heavily utilized PDFMiner under the covers. Besides ABBYY soft (which is proprietary, licensed), does anyone have other recommendations? |
|