Hacker News new | ask | show | jobs
by dmezzetti 880 days ago
One additional library to add, if you're working with scientific papers: https://github.com/kermitt2/grobid. I use this with paperetl (https://github.com/neuml/paperetl).