Hacker News new | ask | show | jobs
by bmh_ca 3843 days ago
While interesting, and looks to be a needed services, the page leaves many questions, such as:

What's the privacy model? While the PDFs are deleted, what happens to the searchable content? Is it also deleted?

What's the revenue model? How can we be sure it'll be around in a few months?

Is there an AJAX interface?

Is the quality or performance better than running Tesseract on a server?

1 comments

Also relevant, how do we know they're not injecting a pdf exploit into the final document? There is no real company information to hold anyone accountable. There could be a dozen websites like this for people to use for "free" that will inject malicious script which most antivirus apps won't detect. Not saying it's not useful, and awesome if legit but there should be more accountability. This is almost the internet equivalent of a stranger in a car waving free candy at a child walking down the street. My workplace was hit yet again today with a Cryptolocker variant (second time this month) which required us to restore thousands of files from backup. All from clicking on a link in an email.