Hacker News new | ask | show | jobs
by hdjjhhvvhga 1603 days ago
No, this would not work for a couple of reasons. Manipulating the content itself such as changing the order of words is very dangerous as it can influence the meaning, and if you process things at scale it could lead to devastating consequences. But there are many other aspects of text such as kerning and others (a dozen or so in this particular case) that are virtually invisible to the reader but are detectable by a machine. I'd prefer not to get into the details of the implementation here but of course a dedicated team with enough resources could successfully break it after some time - but I believe it wouldn't make any sense economically.
1 comments

> as kerning and others

Those can be "removed" by rendering to text and regenerating a PDF, though. Or even with print + scan + OCR.

Neither are trivial, but doable.