|
|
|
|
|
by quinnjh
320 days ago
|
|
I don’t think people are suggesting :
Build a renderer > build an ocr pipeline > run it on pdfs I think people are suggesting :
Use a readymade renderer > use readymade OCR pipelines/apis > run it on pdfs A colleague uses a document scanner to create a pdf of a document and sends it to you You must return the data represented in it retaining as much structure as possible How would you proceed? Return just the metadata of when the scan was made and how? Genuinely wondering |
|