Hacker News new | ask | show | jobs
by throwaway7783 384 days ago
I'm biased, but this is yet another thing my team has to build and maintain now.

I am not in the business of building and maintaining integrations, but trying to derive and surface signals that are useful to my business, from all the data (often crappy data) that's generated. Even just dumping data in a database is not sufficient, because it's not clean, not unified with other data, highly duplicative in many cases and so on.

I'm glad these options exist though, for teams that thrive on building everything themselves.

1 comments

Totally understand - you want clean, unified data for business insights, not another integration tool to maintain. Sequor actually grew out of our Master Data Management (MDM) work where data cleaning and deduplication are core challenges. We focused on API integration for this release, but have mature data cleaning/deduplication components that we plan to open source as well. What specific data quality issues are you dealing with? Happy to share what we've learned from MDM projects and our data quality engine.
That's exactly where I landed too. We didn't need a modern data stack managed by a data team, where everything is coded with significant turnaround times. We ended up using an MDM (Syncari).

MDMs are unsexy and have a lot of baggage filled with legacy, expensive vendors. But the principles are sound, and more modern platforms have turned out to be pretty good.