Hacker News new | ask | show | jobs
by denvrede 2884 days ago
Shortlist:

- Apache Tika + Tesseract for OCR of mails - i hate physical paper

- SOLR to index the output of the above mentioned data

- Imaginary (https://github.com/h2non/imaginary) for image pre-processing of the scanned mails / documents. Its much more lightweight than imagemagick.

- Openhab2 for home-automation