| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ccorda 1168 days ago
	Kenton Varda (tech lead) has some more notes in this twitter thread: https://twitter.com/KentonVarda/status/1659551757796515846

3 comments

rektide 1168 days ago

> Our new engine is based on intercepting SQLite's disk writes and doing clever stuff with them. It was so easy because the file format is not just well designed but amazingly well-documented.

Quite the hobby project for a lot of people too! Other folks doing this:

Rqlite https://hn.algolia.com/?query=Rqlite&sort=byDate , Dqlite https://hn.algolia.com/?query=dqlite&sort=byDate , Litestream https://hn.algolia.com/?query=Litestream&sort=byDate / LiteFS https://hn.algolia.com/?query=LiteFS&sort=byDate, marmot, mvsqlite https://hn.algolia.com/?query=mvsqlite&sort=byDate

link

otoolep 1168 days ago

I've been doing it for almost 10 years. :-)

https://www.philipotoole.com/9-years-of-open-source-database...

link

aranke 1168 days ago

Curious if you prototyped DuckDB before deciding to invest further into SQLite.

DuckDB works great as an in-memory database (it's also the default mode).

link

kentonv 1168 days ago

We didn't, no.

I'm sure there's a lot of really cool local-first databases out there, but SQLite has the benefit of being incredibly widely battle-tested, with literally billions of installations worldwide. It has received thorough security research and fuzzing (it's part of Chrome's attack surface after all). And there's tons of resources online to help people understand how to use it. Although I'm sure there are alternatives that serve certain use cases better it's hard to imagine anything coming close for ours.

That said, the storage engine we've built is not that heavily dependent on SQLite specifically. Any database that uses a write-ahead log like SQLite does should be possible to adapt to it in the future. So maybe we'll eventually open it up to a variety of choices, or even let you bring your own as a Wasm module.

link

kentonv 1168 days ago

Oh, I've been informed that DuckDB uses SQLite under the hood, so maybe compiling DuckDB to Wasm and running it on top of this will be possible, we'll see.

link

aranke 1168 days ago

From https://news.ycombinator.com/item?id=23290512:

> DuckDB is indeed a free columnar database system, but it is not entirely built on top of SQLite. It exposes the same front-end and uses components of SQLite (the shell and testing infrastructure), but the execution engine/storage code is new.

link

rektide 1168 days ago

I expect most workloads are more OLTP/transactional than OLAP/analytical.

link

elithrar 1168 days ago

Correct. DuckDB is really interesting technology, but it's not a direct successor to SQLite for transactional workloads. It's also very new: there's a LOT of new code in DuckDB on top of the (heavily fuzzed) SQLite parts.

(I use it personally, but it's not the same thing as what we're building with D1)

link

piperswe 1168 days ago

Also, for OLAP workloads there's Workers Analytics Engine. Analytics Engine is arbitrarily scalable as opposed to a D1-like solution - I can almost guarantee I've inserted more data into its internal variant (which is built on extremely similar underlying infrastructure) than just about any customer would think of doing, and it handles it like a champ.

Basically, we already have a product for doing DuckDB-like things, and the D1 architecture isn't great for high volume OLAP workloads.

Disclaimer: I'm an engineer at Cloudflare, but not on a developer platform team. I'm not speaking for our developer platform strategy or anything, I'm just commenting on how it looks from where I sit.

link

radiofmd 1168 days ago

Great link & insights from Kenton which shows once again that CF is on the edge (what a pun haha).

link