| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by xwdv 1097 days ago
	It’s fairly trivial to detect bad data, especially when you know what good data looks like. And once you’re in, they can still get useful data even if you lied.

1 comments

acover 1097 days ago

That's surprising. How do you trivially detect bad data? (Beyond extreme examples like below minimum wage)

Even when you know the true distribution (in which case why ask for more data) you can only remove outliers. But outliers exist.

There are techniques for when someone enters many numbers but I don't know any for 1 number.

link

flangola7 1097 days ago

For one, how carefully did the user input information into fields. If you're making it up, it will be much faster and less hesitant than a legitimate entry.

That and a hundred other data points help create a ML model that reliability identifies illegitimate activity.

link

acover 1097 days ago

Cool! That actually exists, more or less: https://www.nature.com/articles/s41598-018-20462-6

link