|
|
|
|
|
by zavertnik
892 days ago
|
|
Nice post, OP! I was super impressed with the Apple's vision framework. I used it on a personal project involving the OCRing of tens of thousands of spreadsheet screenshots and ingesting them into a postgres database. I tried other OCR CPU methods (since macOS and Nvidia still don't play nice together) such as Tesseract but found the output to be incorrect too often. The vision framework was not only the highest quality output I had seen, but it also used the least amount of compute. It was fairly unstable, but I can chalk that up to user error w/ my implementation. I used a combination of RHetTbull's vision.py (for the actual implementation) [1] + ocrmac (for experimentation) [2] and was pleasantly surprised by the performance on my i7 6700k hackintosh. I wouldn't call myself a programmer but I can generally troubleshoot anything if given enough time, but it did cost time. [1]: https://gist.github.com/RhetTbull/1c34fc07c95733642cffcd1ac5... [2]: https://github.com/straussmaximilian/ocrmac |
|
Could you run a farm of macOS machines and turn this into an API for profit? Would that be legal?