Hacker News new | ask | show | jobs
by xwdv 1097 days ago
It’s fairly trivial to detect bad data, especially when you know what good data looks like. And once you’re in, they can still get useful data even if you lied.
1 comments

That's surprising. How do you trivially detect bad data? (Beyond extreme examples like below minimum wage)

Even when you know the true distribution (in which case why ask for more data) you can only remove outliers. But outliers exist.

There are techniques for when someone enters many numbers but I don't know any for 1 number.

For one, how carefully did the user input information into fields. If you're making it up, it will be much faster and less hesitant than a legitimate entry.

That and a hundred other data points help create a ML model that reliability identifies illegitimate activity.

Cool! That actually exists, more or less: https://www.nature.com/articles/s41598-018-20462-6