Hacker News new | ask | show | jobs
by gryn 1830 days ago
allenai seems to be working on something like that for pdf files.

https://github.com/allenai/pawls