|
|
|
|
|
by Gormo
4694 days ago
|
|
It seems a bit too coincidental that images to which human beings assign semantic value are being transformed into images to which human beings assign different semantic value. I don't expect the scanner to have any semantic awareness of the document content, so when I hear "lossy compression", my expectation is "image may become illegible", and not "image may remain legible, but become inaccurate". |
|
The issue only involves small letters, because the compression scheme breaks up the image into patches and then tries to identify visually similar blocks and reuse them. Certain settings can allow for small blocks of text to be deemed identical, within a threshold, and thus replaced. That's all. Coincidence, not semantic awareness.
Hence the advisory notice to use a higher resolution -- smaller block sizes.