Hacker News new | ask | show | jobs
by cess11 298 days ago
It's nice, I've used it as a fallback text extraction method in an ETL flow that chugged through tens of thousands of corporate and legal PDF files.