|
|
|
|
|
by srajabi
1277 days ago
|
|
Great site! I feel like it's missing some things though: * Data Entry -- OCR and the like * Data Retrieval -- Don't search engines still qualify as AI * Sorting mail? * Other factory use cases like removing undesirable tomatoes: https://www.youtube.com/watch?v=aYQ_5c6m8Is * Many others I'm not thinking of... |
|
To highlight the limitations, look at an OCR'd version of a technical book with code samples and different fonts and styles that have different meanings, and that has both footnotes and endnotes. The text will be readable, but disorganized, probably inconsistent styling, and even if some footnotes and endnotes are linked by a good engine, I suspect that's less than fully reliable. For the purposes of reading the book, I'd rather have the scanned pdf with page images for reading, with the OCR'd text as the text layer for searching.
Lower-quality source images seem to cause major problems for tesseract, and even ABBYY judging from archive.org text conversions. Those engines confuse more ambiguous letter or punctuation combinations, while humans can still read the images without much trouble.