|
|
|
|
|
by muyuu
4741 days ago
|
|
>We’ve put a lot of work into making deletions permanent, so I can imagine how frustrated you must be It shouldn't be necessary to put a lot of work. Just delete the content, clobber it with blank fields, or randomized content. It really is very easy. The problem is that your company's policy is to never truly delete content despite calling it "delete", there are many ways to weasel-word it ("many software subsystems don't truly delete" blah blah) but when someone asks a third party to delete something (s)he doesn't ask for it to have it marked deleted. Just delete it. Surely not your fault though. PS: there's no money in this world so that I'd work for a company with the morals and values of Facebook. |
|
But for a large social site, it starts to become a more interesting distributed systems problem.
Even for a simple story with 4 likes, 12 comments, and likes on several of the comments, the data is sharded across several databases. Simple transactions no longer work. One option is looking for scalable distributed transactions. A more likely option is making sure deletions can be interrupted and restarted without losing state.
Now imagine a public figure whose post has tens of thousands of comments and hundreds of thousands likes. You may not be able to load all of those rows in a single request let alone delete them synchronously. Instead, you have to be able to process the deletion incrementally, reading and deleting some of the data then later picking up where you left off the last time you were interrupted.
Interestingly, the order of these operations can also be important. If you're interrupted between deleting a comment and deleting the likes on that comment, will you still be able to find the likes when you restart?
We've put a lot of work into a deletion framework that deals with these and many other issues. We're hoping to share more details about it in the future.