Hacker News new | ask | show | jobs
by gregwebs 1042 days ago
Replibyte also does this and supports anonymization: https://www.replibyte.com/docs/introduction

You might want to combine a subsetting tool with an anonymizing tool like this one:

https://datanymizer.github.io/docs/

1 comments

"Replibyte" is a toy. This thing operates seriously on database dumps - a text file full of "insert" statements (what DBMS often generates like this, but no guarantee)
Do you mean it is a toy because it only works on database dumps or..? I can imagine that querying, filtering and traversing relations will be harder with only a textual dump.
It operates on database dumps. That alone is absurd. Database dumps are not public interfaces. And they are really not suitable for mass data processing.

You have a DBMS available, but all you ask from it is a dump of all data in a format that can only be used as an archive or transfer format. And yet you think you can process this data reasonably?

> would you expect to be congratulated on your unusually high IQ?

I guess your "you" is not directed at me, as I basically wondered the same.

Replibyte seems like to use a highly unpractical way as I explained, but I meant to ask wether you only found the implementation to be bad, or that you also found it lacking in other terms.