Hacker News new | ask | show | jobs
by kresten 2351 days ago
Apache Tika is a powerful text extraction engine.

Why this over Tika?

1 comments

That's a really good question! I've been using for quite some time Tika as the Swiss-army knife for text extraction.

They don't seem to be even using Tika behind the hood as any of the bundled tools. Perhaps anyone has some comparisons?