Hacker News new | ask | show | jobs
by acover 1097 days ago
That's surprising. How do you trivially detect bad data? (Beyond extreme examples like below minimum wage)

Even when you know the true distribution (in which case why ask for more data) you can only remove outliers. But outliers exist.

There are techniques for when someone enters many numbers but I don't know any for 1 number.

1 comments

For one, how carefully did the user input information into fields. If you're making it up, it will be much faster and less hesitant than a legitimate entry.

That and a hundred other data points help create a ML model that reliability identifies illegitimate activity.

Cool! That actually exists, more or less: https://www.nature.com/articles/s41598-018-20462-6