Y
Hacker News
new
|
ask
|
show
|
jobs
by
joshmn
3986 days ago
Hmm. Now I'm thinking that I might end up using your idea (scraping the dark web) and using something like httrack[0] to do exactly that: structure.
[0]
https://en.wikipedia.org/wiki/HTTrack
1 comments
gwern
3986 days ago
I once tried using HTTrack, but I found it was doing too much magic under the hood and was hard to work with. As dumb as wget is (that blacklist bug is over 12 years old now!), it at least is understandable.
link
joshmn
3986 days ago
Thanks for saving me the headache :)
link