Hacker News new | ask | show | jobs
by hinnisdael 1003 days ago
Aren‘t you losing information in the parts that aren‘t perfectly straight? Yes, you can stretch those to recreate the original layout, but that would come at the cost of resolution in the interpolated sections of the page. Granted, not a problem for most books, but probably a reason prople are still looking for mechanical solutions to the problem.
2 comments

I think the suggestion is that with AI you can interpolate to the actual letterforms, not to pixels.

Working a typical volume the letter “e” will appear hundreds of times and be identical, so there should be lots of data to help resolve ambiguities in the poorer parts of images.

Not to mention data that can be used across volumes.

If the goal is to ultimately OCR then it's moot. But yes, of course information is lost.

That being said, modern phone cameras are going to produce "scans" above 300 DPI, and while 600 DPI or higher might be tricky they're stills possible if you take partial shots of a document, assuming you can focus that close.

What you lose in quality you make up with convenience, I suppose.