Hacker News new | ask | show | jobs
by microo8 1688 days ago
I don't have very much of the paperwork. Just wanted to organize our home documents. So I have a script that gets the image from scanner, then crops, corrects skew and runs tesseract on it. Then saves it as pdf with text.

Then I've written a shitty tool to make a fulltext index of all the pdfs in the current directory https://git.sr.ht/~ghost08/pdfq (It doesn't have documentation or even a README so ...)

And then I search like so: `cd my/documents/path && pdfq index && pdfq search "my query"`, it just prints the file names which match the query.

Also for backup I use syncthing, so I have 3 copies of all my documents on 3 computers in my house.