Hacker News new | ask | show | jobs
by opto 438 days ago
Looks like a great project, and one sorely needed by people like me who find themselves trying to get hold of old books they can't get in their local library and that are too expensive to buy secondhand.
3 comments

As far as I know Standard gets their raw ebooks from Project Gutenberg which has a vastly greater collection of public domain works. What they're doing is typesetting them for the average reader. But if all you're looking for is just the content, Gutenberg is the place to look for ethically clean copies.
The shadow libraries such as Anna's Archive are a treasure trove of old books, and you're not breaking any imaginary law by downloading old books which are out of copyright.
If a book is out of copyright you can usually find the scan on Internet Archive. No need to look elsewhere at all.
The internet archive's open library will also link to Standard Ebooks (and Gutenberg and a few others) if a version exists of a book you are looking at e.g.:

https://openlibrary.org/books/OL37044523M/The_Woodlanders

If a book is still in copyright, chances are you’ll find it there as well.

Scans suck though, even a badly OCR’ed EPUB is way better.

The scans can have a different copyright date than the book itself.
There is no copyright on scans.

Scanning is not transformative and does not result in a derivative work which can is protected by copyright law.

https://en.wikipedia.org/wiki/Wikipedia:Scanning_an_image_do...

https://law.stackexchange.com/questions/1214/who-owns-a-copy... points us to read the Compendium of US Copyright Office Practices at https://www.copyright.gov/comp3/docs/compendium.pdf

> 313.4(A) Mere Copies

> A work that is a mere copy of another work of authorship is not copyrightable. The Office cannot register a work that has been merely copied from another work of authorship without any additional original authorship. See L. Batlin & Son, 536 F.2d at 490 (“one who has slavishly or mechanically copied from others may not claim to be an author”); Bridgeman Art Library, Ltd. v. Corel Corp., 36 F. Supp. 2d 191, 195 (S.D.N.Y. 1999) (“exact photographic copies of public domain works of art would not be copyrightable under United States law because they are not original”).

A pdf file can contain more than just the raw images of the pages.
Certainly! If you add my latest Kirk/Spock slash fanfic to the end of the text, then that is transformative, so the resulting PDF is covered under copyright.

But you wrote "scan". Adding an OCR'ed text layer, or doing manual proofreading and layout ("sweat of the brow") is not sufficiently transformative to have copyright protection.

And we were specifically talking about scans of old books stored in shadow libraries.

Tracking down older or out-of-print books can be weirdly frustrating, especially when prices for secondhand copies get absurd