I assume they were hand-scraped. (Actually, you could probably mturk this if you needed a larger dataset.)
The link in your blog still redirects to Google Drive :)