Hacker News new | ask | show | jobs
by Eisenstein 480 days ago
If you just want to play with using a vision model to do OCR, I made a little script that uses KoboldCpp to do it locally.

* https://github.com/jabberjabberjabber/LLMOCR