Hacker News new | ask | show | jobs
by bob1029 721 days ago
I would consider rasterizing the PDFs and then hashing the resulting bitmaps.