Hacker News new | ask | show | jobs
by mritchie712 410 days ago
The best open source options are Airbyte and Meltano / Singer. But it's hard to keep them running. If you self-host them, you'll hit issues at least a few times a month which can each take a few hours to solve.

It's not like running Postgres which "just works". When you self-host Airbyte, you're still building a good bit.

I felt the same way about the cost of data tools. Paying $1,000 for Fivetran, $2,000 for Snowflake, $2,000 for Looker seemed crazy. We bundle all three for $500 / month at https://www.definite.app

3 comments

Your comment reads like a pitch but I checked definite and I've been looking for something like this. What we you using it for? Did you evaluate any other Ai analytics tools?
Yes, I've looked at them all. Most AI analytics tools are doing "text to SQL", but writing SQL is a small percentage of data work.

We built an entire stack so the agent can operate across that whole stack (e.g. create pipelines, model data, build reports, etc.)

Hi, I've been loking something like this! Any of your custumer has success story migrating off bigquery to your platform? And how do you compare to motherduck? (Looks like you built some of ypur stack on top of duckdb)
Yes, we've had many bigquery / snowflake converts. The reality is, most companies don't have 100tb of data (which is what those platforms are optimized for). Motherduck has a good post[0] on this:

> There were many thousands of customers who paid less than $10 a month for storage, which is half a terabyte. Among customers who were using the service heavily, the median data storage size was much less than 100 GB.

I'm a fan of what motherduck is doing. We're building something different (opinionated, instant data stack), but yes, we both use duckdb under the hood.

0 - https://motherduck.com/blog/big-data-is-dead/

Airbyte’s not an alternative for reverse-etl though. Also your pricing page also says $1k per month.
we have a discount for startups, which would qualify many on HN