Hacker News new | ask | show | jobs
by xapata 3493 days ago
Or they could just special-case GitHub repos and their readme files. But what is duplicate detection if not machine learning? One can't be sure the same URL is the same document if submitted at different times.