Hacker News new | ask | show | jobs
by tiddchristopher 5009 days ago
It's not at all like that. The interesting thing seems to be the claim that Craigslist posts lack creativity, which Amazon reviews certainly do have. 3Taps is claiming the posts can't be protected by copyright, because you can't copyright facts. I think their case has merit.
2 comments

My understanding was that, as the owner of a site, (even one that hosts mostly/completely content created by someone else), you get to dictate the terms on which any party accesses that site. It would be like if YouTube said that you may not access their site with anything but a web browser.

Strange that they went the copyright way, though. IANAL, but isn't accessing a service you've been explicitly forbidden to (via ToS or similar) a violation of the CFAA?

3Taps doesn't access Craigstlist so the TOS has no effect.
You are 100% wrong. The ad copy on craigslist IS copyright protected so it doesn't matter where you scraped it from. TOSs do govern the data whether or not you are viewing a cached copy or not. By your logic all TOSs are rendered useless if you browse the net via proxy servers.

You can scrape anything off the internet you like and do as you please...but you can't create a business out of it.

Site TOS don't really matter too much or else Google could never exist. They're mostly to limit the liability of the publisher and not to prevent anything with the accessor. Googlebot is not a lawyer, it can't decide the legal implications of scraping or what exact activity can be done with the data it finds.

Google also overlays facts it finds by crawling the web in maps (Google Places and the One Box results) and uses the creative contribution of other people to provide things like reviews inline on SERP. Google is a bad target to sue though because they will punch you right back in the face.

"You can scrape anything you like off the internet and do as you please... but you can't create a business out of it."

I tried to tell this to the Google guys in the 1990's but they didn't listen! :)

It is not black and white, despite your assertion. It should be tested in court.
Are 3Taps verifying by hand that every scraped post are only facts? I've seen some pretty creative CL posts. Also it's been mentioned that assemblages of facts can in fact be copyrighted. See the Farmers Almanac.
You're right about that, and I think that's going to make their claim a bit unsteady. But I see an argument they might make:

1. Classified ads are facts, with negligible creativity put into their composition. 2. Facts are not copyrightable. 3. Therefore, 3Taps can scrape the ads. 4. If any ad actually does have creativity / an applicable copyright, the copyright holder can contact 3taps with a complaint.

Basically a "safe harbor" take on the whole thing. What do you think?

Feist v. Rural (http://en.wikipedia.org/wiki/Feist_v._Rural) seems to indicate the the arrangement of the facts is copyrightable, (if there is even a minimal degree of creativity) even though the facts themselves are not. The court held the alphabetic arrangement of names is not sufficient to warrant copyright protection, and it seems logical that mere temporal arrangement of classifieds would not merit protection either. However, it seems as though 3Taps would need to find a way to rearrange the facts presented within the advertizement, because that certainly is a creative process.

The seller has the freedom to write text a description and include pictures. Photographs have a long history of copyright protection, as does written work. Simply copying the specification sheet for a particular model of TV into the ad, for example, might not be copyrightable, but how does 3Taps sort those from the rest?

I just stole this from a random Craigslist ad:

"Black futon in good condition. The mattress is a lot thicker than most futons. It is pretty easy to assemble. You must be able to pick it up though because I do not have a truck. I am moving in a week so I need to get rid of it, asking for 100 or better offer. Please use the link above to email me. "

That the object is a futon, that said futon is black, thicker than average, easy to assemble and in good condition are all facts. The seller could have presented those facts in any number of ways, but he or she chose this way (proper spelling, for example, and mostly written in complete sentences) because he or she thought that would generate a better response. The only way I can see 3Taps on solid ground is if they can take the ad text, use it to generate a set of facts, and rewrite the ad from that (analogous to the way the PC BIOS was reverse-engineered, but done mechanically (I can't imagine it would scale well to have humans rewrite the ads))

Excellent points, especially regarding photographs. I was thinking of that case already, and hadn't considered the arrangement of facts within a post; what came to mind first was the collection of posts as their own arrangement of facts.