Hacker News new | ask | show | jobs
by _delirium 5196 days ago
It looks like the main difference is two-way integration: instead of just scraping data from Wikipedia dumps to produce a structured database (like Freebase and dbpedia do), it's going to store the canonical version of some of the information there, and pull from it to populate the infoboxes. One of the motivations seems to be to keep the data in sync across Wikipedia languages, so an addition or fix propagates to them all, which is currently done somewhat awkwardly by a mix of manual and bot measures.
2 comments

For the interested reader, here a cool paper on Information Arbitrage Across Multi-lingual Wikipedia: http://www.cond.org/paper_202.pdf
So they are adding an extra layer?...Who said that CS is the science where everything is solved with an extra level of indirection?
Who said that CS is the science where everything is solved with an extra level of indirection?

Looks like David Wheeler made the statement that I think you're referring to:

http://stackoverflow.com/questions/2057503/does-anybody-know...

http://en.wikipedia.org/wiki/David_Wheeler_%28computer_scien...