Hacker News new | ask | show | jobs
by brudgers 2850 days ago
Maybe Apache Tika?

https://tika.apache.org/