|
|
|
|
|
by mytddu
3600 days ago
|
|
My gripe with PDF is that it's the standard format for academic publishing, rendering a whole mass of scientific knowledge largely inaccessible for text processing purposes. I've wanted to analyze the Libgen archive of journal articles for a long time but have never found an adequate solution for extracting text from PDFs. Any suggestions on this? |
|