Hacker News new | ask | show | jobs
by ineedasername 2311 days ago
The Xerox OCR problem is exactly what came to mind after reading the first few sentences t of the article. And that problem happened well after the times when OCR of standard text had been considered a "difficult" problem. That said, I'm not against using this sort of development, I just think they need to be treated with skepticism and constantly evaluated. If deployed widely, some percentage of scans should always be evaluated from a QA perspective to always be vigilant of misclassification, drift, etc.