| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by hdjjhhvvhga 1603 days ago
	No, this would not work for a couple of reasons. Manipulating the content itself such as changing the order of words is very dangerous as it can influence the meaning, and if you process things at scale it could lead to devastating consequences. But there are many other aspects of text such as kerning and others (a dozen or so in this particular case) that are virtually invisible to the reader but are detectable by a machine. I'd prefer not to get into the details of the implementation here but of course a dedicated team with enough resources could successfully break it after some time - but I believe it wouldn't make any sense economically.

1 comments

berkes 1598 days ago

> as kerning and others

Those can be "removed" by rendering to text and regenerating a PDF, though. Or even with print + scan + OCR.

Neither are trivial, but doable.