Hacker News new | ask | show | jobs
by isotypic 653 days ago
Comparing common crawl to video makes no sense. Common crawl is text extracted from webpages. 424 terabytes of pure text contains exponentially more text than I will read in my entire life.