Hacker News new | ask | show | jobs
by ianbutler 1691 days ago
Congrats on launching!

So how do you compare to a Data Catalog like datahub? https://datahubproject.io/

From the video you looked very similar to them as a metadata consumer and they provide extensive API integrations so you can add basically any set of metadata you want including slack, jira etc. They're also offering a hosted version.

Their metadata is indexed into a tuneable ES cluster so you can fiddle with relevance etc to your hearts content.

What's your big differentiator?

1 comments

Thank you! Secoda is different from DataHub in a few ways:

1. If you're using the DataHub open source solution it requires a data engineer to get the platform up and running and maintained, which can be a fairly expensive cost depending on the salary of the data engineer. Secoda has 15+ no code integrations that can be setup in 5 minutes and is a fully managed solution. We are releasing a metadata API that will be available before the end of the year, in case an organization is using a product that we do not currently integrate with.

2. Acryl (managed version of DataHub) is mainly focused on the data catalog, which they do a great job for. However, they don't provide the questions, dictionary, and visualization components that we provide in addition to the catalog. These additional components of the product add more context around data knowledge, and are also focused on helping non-technical users understand company data. Whereas the data catalog is focused more on helping technical data users understand company data.

3. Also if you're using Acryl, you'll have to get in touch with their team to get a demo of the product. For Secoda, you can signup at https://app.secoda.co and try out a free trial of the product without having to talk with our team. We do offer demos if people are interested though.

Hey Etai and team, Congrats on the launch! I’m so glad to see several teams trying to tackle this hard problem of complexity in the modern data stack.

I’m Shirshanka, the founder of the DataHub project, occasional responder to HN threads and reachable at https://slack.datahubproject.io :)

I wanted to respond to some of the text here since DataHub and Acryl Data was directly mentioned.

1. We’ve heard repeatedly from the community that DataHub quickstart just works in 5 mins or less (besides a current known issue with M1: thanks Apple!). Once people are able to show value with the quickstart and the pre-packaged connectors that connect upto 20+ systems, they quickly move towards a deployment model based on helm, that is open source and maintained by the Acryl team. All of this requires no code. Deploying DataHub using the provided helm charts is also quite easy based on what we’re hearing from the community.

2. Acryl Data is reimagining what a data catalog can do, data discovery, data observability and federated data governance. We believe that techniques like semantic knowledge graphs are only useful and reliable if they are built on top of a live and fresh operational metadata graph. Also we see ourselves not just as an “end user tool”, but as a central fabric through which metadata is stored, and transformed before integrating in other tools. As a result we are intentionally, API-first and stream-first.

3. We already offer the open source DataHub demo at https://demo.datahubproject.io. People talk to Acryl Data after they have already tried out the open source product and they are looking for a managed version that has more to offer.