Hacker News new | ask | show | jobs
by mark_l_watson 5241 days ago
I just downloaded the dump. It is HTML and JPEG images. So, you are correct: you would need to write a script to extract what you want.