Hacker News new | ask | show | jobs
by nix23 1573 days ago
>but my machine crashed

Maybe it's not the servers who throttle you then ;)

1 comments

Wget will exhaust all available ram on a long enough crawl.
No, i crawled many multi-gigabyte sites with my raspberry2 for days.
I've had memory exhaustion (on a 4GB system) after I think about 600GB in a single crawl. Splitting it into multiple crawls is of course better.

That was a site specifically set up to deal with large collections of files though.