Hacker News new | ask | show | jobs
by aldonius 687 days ago
msephton's sibling comment notes that Pages files are a ZIP bundle. (I just tested this, and that's the case - actually they're not even compressed, just stored.)

Inside this zip file there's a number of things:

- three preview JPGs at varying sizes (roughly 50px, 250px, 1000px) - a Metadata folder with a couple of plists and a UUID file - an Index folder which contains a bunch of .iwa files which appear to be some Apple iWork suite specific thing. I'm not sure if there's a real file format there or if they all have a different format keyed on the file name. If you run `strings` on one of them you could plausibly recover document text.