Hacker News new | ask | show | jobs
by manlio 4084 days ago
All these strategies are interesting, but I'm afraid we are over-engineering the problem here. The pretty simplistic strategy I'm using now is basically just pattern matching, and so far I had only 4 misplaced posts out of the 840 for April alone: that is < 0.5%. And it's blazing fast! I can rebuild the entire db in less then 30 seconds.

Given these number I believe pretty much everything more complicated than that would be a total overkill... Good food for thoughts though!

1 comments

I just manually curate in these cases. HN hiring threads don't ever exceed a level where 0.5% manual review would be onerous.
I think you will need 100% manual review to find those 0.5%