Hacker News new | ask | show | jobs
by schneehertz 661 days ago
Moreover, the classification was not done on 500,000 PDF files themselves, but rather on the metadata of those 500,000 PDFs.