VLLM hallucination is a blocker for my use case.
Otherwise I'd say just use your operating system's OCR API. Both Windows and MacOS have excellent APIs for this.
Otherwise I'd say just use your operating system's OCR API. Both Windows and MacOS have excellent APIs for this.