Hacker News new | ask | show | jobs
by closeparen 3213 days ago
(It's a Hadoop distribution).
1 comments

Not a Hadoop distribution as Databricks uses S3 and DBIO for storage not HDFS. It doesn't even use Yarn so it can't really be called Hadoop at all.

Basically the product is Spark Notebooks (think Juypter) on AWS that allow you to quickly create clusters and even do fancy stuff with spark streaming.

A key thing that people miss also is the the creator of Spark is the CTO of Databricks and Databricks to some extent controls the direction of Spark. This probably impacts its valuation.