|
|
|
|
|
by jamescampbell
1623 days ago
|
|
I built something similar in the past using TesseractOCR and Apache Tika and PyPDF2 / QPDF. The idea is sound. An API based OCR already exists in Apple / Microsoft / and Google so I am not sure this would be that useful. There would be no way for the user to trust that you are not taking the data you are OCR'ing and using it. If you can apply some type of one way encryption of the content and prove it via open source code (like Whisper Systems does for Signal) which seems like overkill and lots of effort for a free app. |
|
Where can I reach them? Thx.