| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by roywiggins 154 days ago
	> Elastic has been working on this gap. The more recent ES\|QL introduces a similar feature called lookup joins, and Elastic SQL provides a more familiar syntax (with no joins). But these are still bound by Lucene’s underlying index model. On top of that, developers now face a confusing sprawl of overlapping query syntaxes (currently: Query DSL, ES\|QL, SQL, EQL, KQL), each suited to different use cases, and with different strengths and weaknesses. I suppose we need a new rule, "Any sufficiently successful data store eventually sprouts at least one ad hoc, informally-specified, inconsistency-ridden, slow implementation of half of a relational database"

4 comments

xeraa 154 days ago

Funny argument on the query languages in hindsight, since the latest release (https://www.paradedb.com/blog/paradedb-0-20-0 but that was after this blog) just completely changed the API. To be seen how many different API versions you get if you make it to 15 years ;)

PS: I've worked at Elastic for a long time, so it is fun to see the arguments for a young product.

link

Joker_vD 153 days ago

Just as any "plain blob storage" eventually evolves a hierarchical filesystem (but with silly quirks!) on top of it.

link

deepsun 153 days ago

AFAIK, in Google it was the other way around -- their main blob storage (BigTable) is built on top of GFS (distributed filesystem).

link

Scaevolus 153 days ago

You have that backwards. GFS was replaced by Colossus ca. 2010, and largely functions as blob storage with append-only semantics for modification. BigTable is a KV store, and the row size limits (256MB) make it unsuitable for blob storage. GCS is built on top of Spanner (metadata, small files) and Colossus (bulk data storage).

But that's besides the point. When people say "RDBMS" or "filesystem" they mean the full suite of SQL queries and POSIX semantics-- neither of which you get with KV stores like BigTable or distributed storage like Colossus.

The simplest example of POSIX semantics that are rapidly discarded is the "fast folder move" operation. This is difficult to impossible to achieve when you have keys representing the full path of the file, and is generally easier to implement with hierarchical directory entries. However, many applications are absolutely fine with the semantics of "write entire file, read file, delete file", which enables huge simplifications and optimizations!

link

deepsun 152 days ago

Thank you, yes my knowledge was very outdated, waay before Spanner.

Spanner for GCS actually explains how public Google Cloud was always ACID for object listing, while S3 only implemented it around 2020. I always suspected that there must be some very hard piece to implement that AWS didn't have until 2020. Makes sense now that that piece was Spanner.

link

esafak 154 days ago

ICYMI https://en.wikipedia.org/wiki/Greenspun's_tenth_rule

link

wasting_time 153 days ago

ICYMI expands to "in case you missed it", ICYMI.

link

patates 152 days ago

I see... why am I...

link

virgil_disgr4ce 153 days ago

ICYMI expands to ... wait, shit