Hacker News new | ask | show | jobs
by carbon12 4667 days ago
The problem is getting the data: a list of papers and their references/bibliography. The great thing about the arXiv is that the papers are open-access, that it is updated daily, and that the daily-update is immediately available to data-mining. Is there a similar thing for med/biomed?
2 comments

Unfortunately no. Many med/bio publications are subscription only. To do something like paperscape, you would need a massive corpus of papers, which would really cost a lot. Pubmed only has abstracts, not the citations from the paper.
Right, but there's a subset of Open Access articles from Pubmed: http://www.ncbi.nlm.nih.gov/pmc/tools/openftlist/
That's true, but coverage of seminal papers is not that great in Pubmed central from my experience. This may change now that there is a push for all publicly funded research to become open access immediately or after 1 or 2 years.
Thanks for the link. We will look into the Pubmed data and see if any of it can be included in the map.
There is always pubmed[1].

[1]: http://www.ncbi.nlm.nih.gov/pubmed/