Hacker News new | ask | show | jobs
by jewbacca 3296 days ago
At least Google and Twitter have data takeout.

I recently discovered that, on Reddit, anything beyond your more recent 1000 posts/comments/upvotes is totally irrecoverable to you, even via scraping.

1 comments

Wait, really?
Yeah. This was pretty upsetting to discover. I had been blindly using my reddit upvote history as a supplementary personal log of sorts, for many years. And most of that's now just gone.

Thank god I haven't made over 1000 comments or posts with any one account.

The data's all still in the database, but due to their caching setup, only the last 1000 of anything is publicly indexed. While everything's technically reachable, it's all deep web. To recover something private like upvoted or saved posts, we're talking heat-death-of-the-universe, through a full-table-scan squeezed through brute-forcing a search box, while authenticated, with rate limits.