| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by aphyr 2046 days ago

> However, either they do not scale or even when they do (newsql), they are 10x slower and handle less volume, etc.

There's something to be said about CQL specifically here, because its transactional model is an odd duck, and not always for scalability reasons. Scylla and Cassandra could offer richer transactions without significant performance penalties--indeed, which would significantly speed up some types of transactional workloads, but instead there are, as the grandparent poster notes, strange edge cases.

For instance, you can't select multiple CQL rows where the select would cover a column backed by a CQL collection. That's been fixed in Cassandra, but is still present in Scylla, and limited the kinds of tests we could write for Scylla.

You also can't perform an LWT write without a guard clause: linearizable upserts in the Jepsen tests require the presence of an always-null field in every row whose only purpose is to convince Scylla that yes, we really would like to use Paxos for this write.

Likewise, there's no logical notion of a batch combining reads and writes, or, for that matter, a batch of reads. If you want those sorts of things in C* or Scylla, I imagine one winds up stringing together reads and CaS statements in retry loops, or something to that effect. We were trying to hack together a batch read via a no-op write, except that it's actually impossible to do that robustly--you can only "read" values involving guard clauses, and there's no guard which will pass on all values. It's just... awkward.

This isn't a scalability thing--all three of these cases could be executed in a single Paxos round, scoped to a single partition. It's just... weird API design.

2 comments

thekozmo 2046 days ago

It's possible to improve cql to a cql++ .. if you start going this path, it will lead to sql which isn't bad a all ;)

It's not 'fair' to compare CQL to SQL, if you compare CQL to DynamoDB's http api, which scylla implements as well, you'll see that cql is better: https://www.scylladb.com/2020/05/12/comparing-cql-and-the-dy...

link

411111111111111 2045 days ago

> It's not 'fair' to compare CQL to SQL

He didn't do that. He pointed out issues with the cql syntax and specifically scylla's implementation of it. he is also uniquely qualified to make such a statement, as his Jepsen Reports are high quality analyses looking at various databases.

But to address your other claim: they solve the same problem of providing data to your application. They should be compared if that's the decision you have to make.

link

andy_ppp 2046 days ago

You should offer an additional service - “how to stop you DB7 from being weird” - in addition to Jepsen. Thanks!

link