Hacker News new | ask | show | jobs
by ternaryoperator 2345 days ago
I'm pretty sure Apache Tika [0] will do what you need (and lots more).

[0] https://tika.apache.org/

1 comments

Do you happen to know of any alternatives not written in Java?