Y
Hacker News
new
|
ask
|
show
|
jobs
by
xapata
3493 days ago
Or they could just special-case GitHub repos and their readme files. But what is duplicate detection if not machine learning? One can't be sure the same URL is the same document if submitted at different times.