Hacker News new | ask | show | jobs
by juretriglav 4374 days ago
It's very unlikely that commoncrawl.org will have access to full text papers, which is mostly based on expensive library/university subscriptions.

Before Scholar Ninja reaches maturity of version 1.0 though, we will be seeding the network with as many sources as we legally and technically can, with a strong focus on properly licensed open access content.