Hacker News new | ask | show | jobs
by cnees 207 days ago
OpenAlex has 240M. https://docs.openalex.org/api-entities/works

CORE has 431M. https://core.ac.uk/data

Crossref has 165M. https://www.crossref.org/blog/2025-public-data-file-now-avai...

These datasets are all biased towards work published in the digital age, but it's important to note that work is coming out much faster now than it used to.

2 comments

So indeed, order 10^9 not 10^8, given the CORE at > sqrt(10)*10^8.
Is that because there is a pressure to publish? As I wouldn't say we make advancements at a rate any different during the last two decades than we have over the 20 years prior to that.