Hacker News new | ask | show | jobs
by soxpopuli 4088 days ago
But the linked page is there. Google is indexing Amazon, extracting the photo, description, and price of the product, displaying it in a box, and making it a link back to Amazon where it got the data from, what's the problem?

Product Search is just another form of summarization/snippeting that just presents the data in more digestable format.

Remember Google Fusion Tables? That was an attempt to extract facts from pages and put them into tables, so if you ask "What's the masses of the planets of the solar system", you could get a table of 8 planets and masses, with the results coming from 8 different sites. But the links could still be there to the original site, it's just formatted as a table instead of as 8 blue links with summary paragraphs, which is harder for humans to process.

Where do we draw the line? You've seen how Google has a new system that can automatically caption images with deep neural networks. (http://techcrunch.com/2014/11/18/new-google-research-project...)

Now what if this same system eventually allows the search engine to summarize your web page by 'reading it', and then auto-generating a paragraph that explains what it thought it was about?

There'd be no actual direct copying of text (like there is with search snippets), instead it would be more like a human going to a library, reading a book, and writing a review

Would this also violate copyright?