Hacker News new | ask | show | jobs
by Avicebron 1115 days ago
apropo username, having taken a crack at pulling relevant information out of scanned documents I agree that scan quality is very important (while often lengthy and expensive) especially if someone is trying to derive meaningful information from a digital copy without the physical copy to do a comparison with.

And from the look of the picture those books are massive and probably very delicate.

EDIT: to add a bit to the expensive part of this, it's expensive even with the willingness and resources to get it done, it's hard but unfortunately to even convince someone to dedicate these resources is a hurdle.

1 comments

Ah, baloney. If you can open the book, you can photograph it with your iphone. You'll find the result answers your concerns. Try it with any of your books.
That reminds me, I have an out-of-copyright book by a namesake where I took the photos years ago — before I had a smartphone let alone one with built-in OCR — and still have not gotten around to transferring the text to wiki… source? wikibooks? One of them.

I should do that.