Hacker News new | ask | show | jobs
by Lifescape 4869 days ago
Started poking around. Here's one chunk of one of the pages at one of the higher resolutions: http://www.bl.uk/manuscripts/Proxy.ashx?view=arundel_ms_263_...

Looks like the format is:

  http://www.bl.uk/manuscripts/Proxy.ashx?view=arundel_ms_263_<page_number>_files/<resolution_level>/<column>_<row>.jpg
Pages work in this fashion: f001r, f001v, f002r, f002v, f003r, f003v, ... and ends at f283v.

At resolution 14, columns range from 0 to 33 and rows range from 0 to 24.

Edit: I hacked together a python script to download all images at resolution 14 https://gist.github.com/L1fescape/4761013

Now to figure out how to combine them all...

3 comments

You might be able to combine the JPEGs in a lossless manner if they have neat dimensions:

http://wiki.thorx.net/wiki/JPEGhack

http://jpegclub.org/jpegtran/

Now to figure out how to combine them all...

ImageMagick's montage command will probably do it: http://konrad.strack.pl/blog/image-concatenation-with-imagem...

Also tiffcp and then tiff2pdf are how I've made pdfs of such image collections in the past.

Circumventing a technical rights protection measure?

Not legal under

> European Directive 2001/29/EC of the European Parliament and of the council of May 22, 2001

and probably the DMCA.

It's not a technical rights protection measure, it's a standard bandwidth minimization measure used by sites like Gigapan and all slippy maps.