Hacker News new | ask | show | jobs
by boarnoah 1357 days ago
There is a case to be made, not just for natural language but code.

AFAIK there is quite a bit of examples from security labs where malware authors aren't necessarily identified but at least fingerprinted based on naming conventions, patterns they use across multiple projects etc...

That sort of fingerprinting could expand to correlating someone's anonymous software projects to other examples of code elsewhere (ex: if they contribute to source available stuff).

re: the example project you mention specifically, it does feel like using tools like that almost as a linter for natural language would be a fingerprint in itself.

EDIT: As far as OPSEC goes, a fun tidbit. A friend of mine identified a PR I submitted anonymously to them, simply because of the style of PR comments I made.

1 comments

I suspect this is why TrueCrypt shut down.

A paper got released in 2015 that claimed 94% accuracy in identifying authors.

I’m sure it would have been quite easy for the NSA to figure out who Satoshi Nakamoto was too considering PRISM was also scooping up everyone’s email.