Hacker News new | ask | show | jobs
by sanusihassan 101 days ago
The tool automatically detects whether a PDF is digitally created or scanned, prompts for document language selection when OCR is needed, and outputs a formatted Excel file with the original table structure preserved.

this is a short YouTube video demo.

Happy to answer any technical questions about the conversion pipeline or OCR implementation.