| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ComodoHacker 3626 days ago
	What OCR library do you use? What languages it supports?

1 comments

For scanned images we use https://github.com/tesseract-ocr/tesseract. For text based PDFs we pull the text directly from the file and all languages are supported.